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PREFACE 



Numerous changes, in the form of additions, omissions, and altera- 
tions, have been made in this edition. Chapters I, V and VI remain 
with little alteration. Chapters III and VIII have been expanded and 
altered, while Chapters II, IV, VII, IX and X are entirely or almost 
entirely new. These changes are wholly the work of Professor Godfrey 
H. Thomson, and the merit of the very great improvement in the book 
is due to him. Owing to my own time during the War being taken up 
entirely with army medical work, I have been unable to take any part 
in the further development of correlational psychology since 1914, and 
it was therefore a very great relief to me when Professor Thomson 
kindly offered to cooperate in bringing out a second edition of Mental 
Measurement. No one could have been more fitted for this task, both 
on the mathematical and on the psychological side, and he has produced 
what is in half its extent a new book, vastly superior to the old. Some 
of the sections are purely mathematical, and these will perhaps appeal 
to a wider circle of readers. 

As this second edition, like the first, is opposed to the theories of 
Professor C. Spearman, I wish to take the present opportunity of saying 
that as an opponent I have learnt to respect and admire his work to 
a very great degree, and looking back dispassionately over six years 
spent in other forms of psychological research, I find myself more con- 
vinced than ever that his work in correlational psychology is epoch- 
making in its significance. Although I have found no occasion to 
withdraw any of my own earlier criticisms, I have gladly taken the 
opportunity of modifying the wording in many respects, in order to do 
better justice to the outstanding importance of his views. Whatever 
may be the ultimate verdict on the significance of "hierarchical order" 
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in groups of correlation coefficients, Professor Spearman's correlational 
work in other directions would alone justify his unique position in the 
domain of statistical psychology. 

It has seemed to me to be essential that Professor Thomson should 
contribute a separate preface, since so much of this edition is concerned 
with the vindication of a theory which originated from his brain alone. 
Should a third edition be called for later on, I hope to be able to add 
a "Part III," summarising results and theories in non-mathematical 
language. But at present the battle rages so hotly that this is not 
desirable, even if it were possible. 

W. B. 

EIinq's COLLEaE, 
London. 

August, 1920. 



II 

Dr Brown's suggestion that we should write separate prefaces to 
this book appeals to me for one reason only, that it enables me to 
shoulder, myself, the responsibility for any errors either in principles 
or in detail which this edition may contain. Any credit I am glad to 
share, but the faults are my own. 

The book is necessarily concerned largely with controversial matters. 
On one of these, the question of the validity of the reasoning which 
Professor Spearman has based on the occurrence of hierarchical order, 
I am entirely convinced that the situation is as described in Chapters IX 
and X. My position is that hierarchical order is the natural order among 
correlation coefficients, that it only expresses the well-known fact that 
correlation coefficients are themselves correlated, and that the degree 
of perfection of hierarchical order found among psychological correlation 
coefficients is merely that which occurs by chance, and not, as Professor 
Spearman has been led to believe, extraordinarily high. 

On this point, then, I feel certain that Professor Spearman has drawn 
over-hasty conclusions, conclusions which may be, by a fine instinct, 
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the true conclusions, but conclusions which are totally unsupported by 
the "hierarchy" argument. 

On a second point at issue, however, namely the question of the use 
of ranks instead of measurements in calculating correlations, I feel much 
less competent to speak. There is a good deal to be said on both sides. 
The crux of the question to my mind is this. Admittedly the form of 
distribution of any set of scores in a mental test is not necessarily 
identical with the distribution of the true scores in mental units. If 
we give up the actual scores and merely retain the order of merit we 
are, it would seem, only giving up a false claim to accuracy and re- 
stricting ourselves to what we are certain of. But when we come to 
perform the actual calcidations we find that we are again assuming a 
form of distribution of the mental quantity, usually a normal distri- 
bution, but sometimes some other, as in the "Foot-rule." It comes 
therefore to this, that in calculating rank-correlations we are refusing 
to take the distribution found and are substituting for it one not found. 
The question then is whether it is more probable that the assumed or 
the measured distribution corresponds to the actual unknown mental 
distribution. I think that each case would have to be taken on its 
merits: but I do feel that in the actual measurement the attempt 
should be made to approximate to the true distribution, by comparing 
differences carefully, as I suggest on pp. 11 — 12. 

However, I wish to make it clear that there is not, on this question 
of the use of ranks, that sharp cleavage of opinion between Professor 
Spearman and myself which cannot be disguised in the case of the 
General Factor argument. 

A third question is that of the correction of raw correlation coef- 
ficients, in which I share to a considerable extent Dr Brown's misgivings 
as to Professor Spearman's formula, and on which Dr Brown has, since 
the first edition of this book, added experimental evidence as to that 
correlation of errors and true values which the Spearman formula 
assumes to be zero (see pp. 160 — 3). 

My thanks are due to the President and Council of the Royal Society, 
and to the Editors of the British Jon/rnal of Psychology, the American 
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Journal of Psychology, the Psychological Review, and Biometrika, for 
their kind permission to reproduce long extracts from articles in the 
publications controlled by them: to Professor F. M. Urban and his 
publisher Mr W. Engelmann for permission to print Urban's Tables for 
the Constant Method: to Dr E. G. Boring of Clark University for valuable 
suggestions: to Dr G. J. Eich for his permission to print his checking 
table: to my colleague Dr G. R. Goldsbrough for reading some mathe- 
matical pages of the proofs : and to Professor Spearman for reading the 
proofs from p. 65 onwards, and the MS. of this preface. Acknowledg- 
ments to authors cited are made by footnotes, but should any omission 
in this or any other respect have occurred, my sincere apologies are 
tendered for the mistake. 

I have welcomed the opportunity which this book "has afforded of 
printing in one volume arguments which, after a delay due to four 
years' military service, I was compelled to publish piecemeal, during 
1919 and the earlier months of the present year, in various scattered 
papers: and I would like finally to thank Dr Brown himself for per- 
mitting his book to be thus mauled by another and will close as I began 
by begging that none of my crimes may be visited on his head. 



G. H. T. 



AEMSTRONa College, 

Newcastle-upon-Tyne. 
Angust, 1920. 
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PAETI 
PSYCHOPHYSICS 

CHAPTER I 

MENTAL MEASUEEMENT 

Equal appearing intervals — Just perceptible distances — The interpretation of 
Weber's Law — Indirect methods of measurement — ^The approach to measurement 
by means of grading magnitudes and their differences. 

(1) EQUAL APPEARING INTERVALS 

The pre-conditions* of measurement in any sphere of experience are 
(1) the homogeneity of the phenomena, or of any particular aspect of it, 
to be measured, (2) the possibility of fixing a unit in terms of which the 
measurement may be made, and of which the total magnitude may be 
regarded as a mere multiple or sub-multiple. These pre-requisites are 
satisfied in the cases of spatial and temporal magnitudes, in terms of 
which, directly or indirectly, all the measurements of the physical 
sciences are expressed. It was thought by Fechner that they are also 
satisfied in the case of the strictly psychical phenomena of sensation- 
intensity, i.e. it was assumed that any given sensation-intensity might 
be regarded as made up of a sum of unit sensation-intensities. This 
view has been definitely rejected by later psychologists. Every sensation- 
intensity is qualitatively distinct from every other sensation-intensity. 
No addition or subtraction of intensities is possible. " To introspection, 
our feeling of pink is surely not a portion of our feeling of scarlet; nor 
does the Ught of an electric arc seem to contain that of a tallow-candle 
in itself" (James)f. Fechner's mistake was due to a confusion of 

* These pre-conditions are those usually stated. But the idea of measurement has 
been so expanded during recent generations by the mathematical ideas of continuity, 
infinity and limit that they are becoming inadequate as a statement of the position. 
Compare the last section of the present chapter, on the approach to measurement by 
means of grading magnitudes and their differences, 
t Principles of Psychology, i. p. 646. 

B. &!• 
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sensation-intensities with the (physical) stimulus-values required to 
produce them. 

Nevertheless, purely psychical measurement is not entirely im- 
possible. Within any one series of sensation-intensities, e.g. a series of 
greys, the contrasts or " distances " separating different pairs of intensities 
are perfectly homogeneous with one another and can be measured in 
terms of one another or in terms of an arbitrarily chosen unit of "sense- 
distance." Given two brightness-intensities a and 6, it is quite possible 
to find, within limits of error, a brightness-intensity c which is as much 
higher than 6 in the scale of intensities as h is than c, i.e. such that the 
sense-distance he — the sense-distance ab; or, again, it is quite possible, 
theoretically, to find a brightness-intensity d which bisects the sense- 
distance ah, i.e. which is such that it is as far removed from a in the 
scale of intensities as 6 is from it — in symbols, ad = dh. Hence the 
"distance," or disparity, of h from a is twice that of i from a, the distance 
of c from a is four times that of A from a. If, now, ad, or the distance 
of d from a, be taken as a conventional unit, the values of ab and cui 
will be 2 and 4 respectively*. 



— »- a 

Scale of 
brightness-intensities 



Fig. 1 



A scale of intensities may in this way be formed rising by "equal- 
appearing intervals" or sense-distances, and the magnitude of any given 
interval may then, theoretically, be read off in terms of the unit-distance 
employed in the construction of the scale. In practice, however, it is 
found more convenient to fix the successive scale-marks, the successive 
members of the intensity-series, in terms of their corresponding stimulus- 
values. It has been found by experiment that in the case of light- 
intensities and sound-intensities the successive stimulus-values form, 
with fair approximation, a geometrical progression, or, in other words, 
each stimulus-value divided by the immediately preceding one gives 
approximately the same quotient. From the stimulus-values corre- 
sponding to an ascending series of eight equidistant brightness-values, 
Ebbinghaus obtained the following series of quotients: 

2-3 2-1 2-1 1-8 1-7 1-7 2-0 

* This view of mental measurement in terms of sense-distance fiist originated with 
J. R. L. Delboeuf, iJemte Philosophique, 1878, v. p. 53. His term for sense-distance was 
"contraste sensible. 
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The 'quotient value is not entirely constant, being slightly greater 
towards the two ends of the scale than it is at or about the middle. For 
this central region, then, the general relation of the sense-distance to 
the stimulus-values is given by the logarithmic formula : 

„ T , -y- 7 , stimulus at b 

bense-distance ab = k log —. = : — , 

° stimulus at a 

where the stimulus at a is one which gives any finite intensity of sensa- 
tion taken as the starting point or conventional zero (N.B. it is not 
necessarily liminal) ; the stimuli at a and b are those which correspond 
to the sensation-intensities of which ab is the "contraste sensible" or 
sense-difference. 

(2) JUST PERCEPTIBLE DISTANCES 

A mode of procedure which not only admits of much wider practical 
application than the above-mentioned "method of mean gradations," 
but also possesses a peculiar historical importance, is that which is 
concerned with the determination of the stimulus-increments corre- 
sponding to just-noticeable increments of sensation-intensity in different 
parts of the intensity scale. Weber found, in a series of experiments 
chiefly with lifted weights, that this stimulus-increment was relatively, 
not absolutely, constant for different regions of the intensity scale, i.e. 
that the stimulus corresponding to any original sensation-intensity had 
always to be increased by a constant proportion to arouse a just-noticeable 
increment of the sensation-intensity. If a 103 grams weight is just 
noticeably heavier than a 100 grams weight, then the weight just 
noticeably heavier than a 200 grams weight will be a 206 grams weight, 
not 203 grams. 

Mathematically formulated, Weber's Law is: 

8 (stimulus) ^ , 

— V: i — = a constant. 

stimulus 

The quantity S (stimulus)/stimulus — r§^ or -03 in our example of weight- 
lifting — is known as the "relative difference limen." It is of course the 
average value of a considerable number of determinations. 

Fechner verified Weber's Law in many different realms of sensation- 
intensity, and made it the basis of his own system of mental measure- 
ment. This he did by making the following three assumptions: 

(1) that a sensation-intensity is a measurable magnitude and may 
therefore be regarded as a sum of unit-intensities; 

(2) that just-noticeable differences of sensation-intensity are equal 

1—2 
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at different parts of the stimulus scale, and may therefore conveniently 
serve as the unit-intensities above-mentioned; 

(3) that the just-noticeable difference of sensation may be treated 
as a difference of two sensations, or at least that if Weber's Law applies 
to the former ("sensed difference") it will also apply to the latter 
("difference sensation"). 

On the basis of Weber's Law and these added assumptions, Fechner 
obtains the following formula, viz. 

, , ^. . i (stimulus) 

a (sensation) = c —^. — i — - , 

^ ' stmiulus 

which he calls tla.Q fundamerdal formula for mental measurement. Inte- 
grating, this becomes 

sensation = c log^ stimulus + C. 

Putting the stimulus in this equation equal to the stimulus T for which 
the sensation is just below the threshold of consciousness, i.e. = 0, we 

^^^' o = ciog,r + c. 

Subtracting the second from the first equation, 

, stimulus 
sensation = c log^ — = . 

Putting T = 1, and transferring to the ordinary logarithm system, we get 
sensation = k log stimulus. 

All the assumptions involved are questionable. The first one has 
already been considered at some length. It is not the single sensation- 
intensity which is measurable, but the distinctness, disparity or distance 
of one sensation-intensity from another. 

We must therefore regard the just-noticeable difference, not as a 
difference of two sensation-intensities but as a minimal sense-distance, 
if we are to be able to make use of it in our scheme of mental measure- 
ment. This modification, however, still leaves us involved in the diffi- 
culties of assumptions (2) and (3). 

Fechner's own reason for regarding all just-noticeable differences 
belonging to any one scale of intensities as equal was that they appear 
equal to introspection. Introspection in a case like this is obviously 
difficult, even for the most skilful observers, and its verdict cannot, 
therefore, be greatly relied upon. Theoretically it is quite conceivable 
that just-noticeable differences, though equivalent to one another as 
being all just-noticeable, i.e. as being sense-distances so small that the 
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sli^test diminution of them would cause ttem all, equally, to cease to 
be noticeable, yet as noticed or perceived would appear of different 
magnitude one from another. Ebbinghaus points to the analogous case 
of differentials or infinitesimals in mathematics. These are all equivalent 
to one another as being all equally negUgible as compared with finite 
magnitudes, yet are by no means necessarily equal to one another. If 
they belong to different "orders," those of a higher order are negligible 
as compared with those of a lower, etc. Again, "the least distances 
perceived as such at different parts of the skin or in direct and indirect 
vision -do not by any means all appear as equal magnitudes. On the 
contrary, so soon as they come to consciousness as distances they are 
at once perceived as distances of varying size, in a certain approximation 
to their objective differences*." In spite of these considerations, 
Ebbinghaus regards the correspondence of the stimulus results obtained 
for equal appearing intervals and for just perceptible intervals in the 
case of brightness-intensities (in the middle region of the scale both 
series of stimuli form a geometrical progression) as sufficient evidence 
for the approximate equality of the latter intervals. Mliller and Wundt 
had previously advanced the same argument. 

Several experimental investigations have been made with the express 
purpose of testing the relation of the methods of minimal change and 
mean gradations. Titchenerf sums up the theoretical basis of such 
experiments concisely as follows: "There are in reality two possible 
ways of working. (1) We might take a series of stimulus-values, corre- 
sponding to a series, say, of eight successive just-noticeable differences of 
sensation, and thereafter directly compare the two half-distances, of four 
just-noticeable differences each, and decide upon their equality or in- 
equality. Thiswould be a direct method of experiment." "Or(2)we might 
determine a few just-noticeable differences of sensation at different parts 
of the stimulus scale, in order to estabUsh the constancy of the relative 
difference Hmen, and thereafter work with supraliminal differences, and 
decide whether the same constancy holds. This would be an indirect 
method; it is the method indicated by the authors just cited [Miiller, 
Wundt, Kohler and Tannery]. ...Either of these two methods would, 
presumably, take us to our goal. The experimental work would be 
exceedingly difficult. Liminal determinations are always and intrinsi- 
cally difficult; and, further, the judgments passed upon just-noticeable 
differences and upon supraliminal differences are, even under the most 

* Ebbinghaus, Grundzilge der Psychologie, 2nd ed., 1905, p. ."524. 
f "Experimental Psychology," n. Instructor's Manual, p. Ixxvii. 
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favourable conditions, the expressions of radically different mental 
attitudes." 

It should be mentioned here that the principal rival to Feehner's 
hypothesis — the "difference hypothesis" — is that first formulated by 
Plateau, and generally known as the "quotient hypothesis." Plateau 
adopted the psychophysical formula 

sensation = c (stimulus)* 
on the basis of experiments by the method of mean gradations. This 
implies, in the place of Feehner's fundamental formula, the formula 
8 (sensation) _ 7 8 (stimulus) 
sensation stimulus ' 

in other words, it assumes that just-noticeable differences are relatively, 
not absolutely, equal sensation-magnitudes. Although Plateau himself 
withdrew his formula later, the " quotient hypothesis " still remains as 
the rival of Feehner's "difference hypothesis." 

To return to the experiments. Merkel (1888) worked with bright- 
nesses, pressures, and noises, and found that the stimulus corresponding 
to the sensation bisecting a supraliminal sense-distance was the arith- 
metical mean of the stimuli corresponding to the two terminal sensations, 
which would seem to support the quotient hypothesis, though such an 
inference is not entirely free from objection. Angell (1892) worked with 
noise-intensities by the method of mean gradations, avoided certain 
sources of error present in Merkel's form of procedure, and obtained 
results supporting the difference hypothesis, the stimulus of the bisecting 
sensation being the geometrical, not the arithmetical, mean of the 
terminal stimuli. 

A more thorough investigation was carried out by W. Ament* in 
1900. He used a series of Marbe greys for the brightnesses and employed 
the direct method; for the noise-intensities he used a Fechner sound 
pendulum and worked principally by the indirect method. The result 
reached supported the quotient hypothesis. But Ament's work has not 
escaped criticism. A repetition of his experiments on brightness- 
intensities by Frobesf has failed to confirm his results. Ebbinghaus 
also has found, in careful experiments with rotating sectors, that just- 
noticeable difierences in different parts of the scale of brightnesses are 
equal to one another. On the whole, therefore, the balance of evidence 
seems to be in favour of the "difference hypothesis." 

* W. Ament, "Ueber daa Verhaltnis der ebenmerkliohen zu den nebermerklichen 
Unterschieden bei Lioht- nnd Sohall-intensitaton," Phil. Stadien, 1900, xvi. pp. 135 ff. 
■f Zeitechrift far Psychologic, xxxvi. p. 344. 
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(3) THE INTERPRETATION OF WEBER'S LAW 

Fechner's third assumption — see above, p. 4 — brings us to the 
question of the interpretation of Weber's Law. 
There are three general forms of interpretation : 

(1) the psychophysical (Fechner), 

(2) the physiological (Miiller, Ebbinghaus, James, etc.), 

(3) the psychological (Wundt). 

According to (1), the logarithmic transition takes place in passing 
from the physiological changes in the sensory centres of the cerebral 
cortex to the corresponding sensation- intensities. The chief objection 
to this view of Fechner's is that Weber's Law is not exact. This same 
consideration supports (2), or the physiological view, according to which 
the transition occurs either at the inception of the stimulus in the sense 
organ, or somewhere between the nerve endings and their central con- 
nections in the sensory areas of the cortex. Experimental results 
obtained by Waller* and Steinachf are in favour of this view. Waller 
stimulated a frog's eye with Hght of different intensities, and found that 
the corresponding "negative variations" set up in the optic nerve varied 
in intensity as the logarithm of the stimulus-values (approx.). Steinach 
obtained similar results on stimulating the skin of a frog's thigh with 
weights and noting the negative variation in the attached nerve. If the 
negative variation may be assumed to be proportional to the intensity 
of the nerve-current passing along the nerve, these results point to the 
conclusion that the logarithmic transition occurs in the sense organ and 
its sensory nerve endings. 

EbbinghausJ has constructed a theory based upon the conception 
of varying degrees of dissooiability of complex molecules to account for 
the law and also for the deviations from it towards the two extremes of 
the intensity scale. 

The psychological view, (3), of the law, held by Wundt, regards it 
as a special case of the general psychological "law of relativity." 
Stimulus, physiological process, and pure sensation-intensity increase in 
simple proportion to one another. The logarithmic transition occurs in 
passing from mere sensation and sensation difference to apperceived 
sensation and apperceived sensation difference, and the intensities are 

* Waller, "Points relating to the Weber-Feohner Law," Brain, 1895, xvm. p. 200. 

t Steinach, " Elektromotorische Erscheinungen an Hautsinnesnerven bei adaquater 
Reizung," Pflilgefs Arch. 1896, Bd. 63, S. 495. 

J Ebbinghaus, "Ueber den Grund der Abweichungen von dem Weber' schen Gesetz 
bei Liohtempfindungen," Pflilger's Arch. 1889, Bd. 54, S. 113. 
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apperceived always in relation to one another. In addition to the 
objection that it regards the sensation-intensities themselves, not their 
distances from one another, as measurable magnitudes, this view is also 
open to the criticism that it furnishes no explanation of the widely 
varying size of the relative difference limen in different sense-depart- 
ments (e.g. DL for brightness-intensities = ^, DL for sound-intensities 
= About \) ; the physiological view sees in the varying structure of the 
different sense organs the adequate explanation of this. Moreover, the 
psychological view has no completely satisfactory explanation to give 
of the deviations from Weber's Law so frequently met with. 

These objections and difficulties make the view improbable, but by 
no means prove it to be impossible. It has the great merit of emphasising 
more definitely than was heretofore the case the importance of the more 
purely psychological factors in psychophysical experiments — in par- 
ticular, it brings into prominence the distinction between mere disparity 
of sensation-intensities present simultaneously or in immediate succession 
in the same consciousness, and the perception of this disparity, the 
discrimination of the intensities one from another. In psycho-physical 
experiments the subject's consciousness is not Umited to the mere 
sensational level. 

In this connection the distinction, explained in Chapter III of the 
present book, on page 75, between two essentially different measures 
of a subject's fineness of discrimination, is not without importance, for 
those two measures, as far as experimental work goes, are both beheved 
to obey Weber's Law. The one is the difference threshold spoken of in 
the present discussion, the other is there defined and named the inter- 
quartile range of the point of subjective equality : and the two quantities 
appear to differ in the level, sensational or perceptual, at which they 
stand. The extended idea of the probability of a judgment there defined 
throws this into prominence. 

Another fact which would seem to have a very immediate bearing 
on the question of the interpretation of Weber's Law is that plants in 
their response to the stimulus of gravity (Geotropism) appear to obey 
that law*. 

Although with continuous increase of stimulus-intensities the corre- 
sponding sensation-intensities rise in steps, each representing a just- 
noticeable difference, the psychophysical relation is really a strictly 

* H. Pitting, Jahrbuch f. wiea. Botanik, Bd. xli. 1905; F. Darwin, New Phytologist, 
1906; James Small, Annals of Botany, xxxi. April 1917; James Small, Proc. Soy. Soc B 
xo. 1918. 
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continuous one, as becomes at once obvious if we consider a special case. 
The sensation-intensity aroused in lifting a weight of 100 grams is 
"indistinguishable," as we say, from that aroused by 102 grams, the 
sensation aroused by 102 grams is "indistinguishable" from that aroused 
by 104 grams; yet the sensation aroused by 100 grams is perceptibly 
different from that aroused by 104 grams. Thus the sensation-intensity 
increases continuously, and the reason why this is not immediately 
apparent is probably to be looked for in the physiological mechanism 
of the psychophysical organism. The fact is that statements Uke the 
above as to two sensations being "distinguishable" or not lack precision 
in the absence of a definition of what is to be meant by distinguishable. 
To introspection almost all sensations are distinguishable from one 
another inasmuch as we seldom will agree that two are identical. More- 
over, although a man will not (in the case of a subject of average sensi- 
tivity) give a majority of answers heavier in comparing 102 grams with 

100 grams, yet the number he does give will, if the experiment is 
sufficiently carefully performed, be greater than he will give with 

101 grams as the comparison weight (100 still being the standard). 
Although therefore he does not give, either with 101 or with 102 grams, 
a majority of answers heavier in comparing them with a standard of 
100 grams, yet he does distinguish them from one another (if we take 
the result of a number of experiments), giving more heavier answers 
with the heavier weight* - 

Delboeuf held that the limen has no psychological importance 
whatever. If this is an extreme view, the importance which Fechner 
attributed to the limen is equally extreme in the other direction. 

The absolute or stimulus limen is similar in kind to the difference 
limen, since consciousness is never empty of sensation-intensities when 
such a limen is being determined. Here again, Fechner's rigid distinction 
of the two was a fundamental error. 

(4) INDIRECT METHODS OP MEASUREMENT 

The preceding account has probably sufficed to show that purely 
psychical measurement is a conceivable possibihty. Its practical appli- 
cation however has been more detailed than extensive. A more generally 
useful method in quantitative psychology is that which measures the 
external, physical or physiological, causes and effects of mental process. 
The measurements are madfe in terms of the physical units of space and 

* Compare Poinoar^ Science and Hypothesis, Scott, 1905, p. 22; and G. H. Thomson, 
British Association Reports, 1913, paper under sub-section I. 
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time, yet they are not merely physical measurements, since they derive 
all their significance from the correlated psychical processes. They are 
indirect psychical measurements*. Measurements of reaction-times, 
memory, fatigue, illusions, etc. are all of this nature. Their varieties 
are innumerable, and are illustrated by the accounts in any good text- 
book of experimental psychology (Sanford, Titchener, Myers). In all 
cases full introspective accounts are essential, and when correlated with 
the measurements make the latter essentially psychical measurements. 
Measurements of limina, referred to in the previous section, are of the 
same nature. They are of some special importance as being measures of 
sensory actiity, etc. — aspects of the total mental ability of psycho- 
physical organisms. They figure prominently in many researches based 
on the use of "mental tests." 

A method which makes a partial return to the more purely psychical 
form of measurement in terms of "distance" is the method of ranks or 
grades. Suppose we are considering the relative abihties of, say, 100 boys 
in EngHsh Composition. We should find it difficult to mark their essays 
individually in terms of any constant unit but might find it possible to 
arrange them in order of merit, especially if we had sufficient time at 
our disposal to employ the method of "paired comparisons." According 
to the procedure of this latter method, the essays would be taken in 
pairs, quite at random, and the better essay of each pair would be given 
a "preference mark." This procedure would be repeated again and again 
until every essay had been compared with every other essay. The order 
of merit is then given by the nimiber of preferences attaching to each 
essay. In this order, however, we cannot assume that the "abiUty- 
distance" from one boy to the next is a constant quantity. The boys 
near the extreme ends of the series will be farther removed from one 
another than the boys near the middle. We could only adjust for this 
if we knew the law of frequency-distribution for this kind of abihty in 
this particular species of boy, and theoretically the determination of 
this distribution depends upon a prior fixing of the psychological unit, 
the unit of "abihty-distance"; so that strictly the problem is insoluble. 
Since however under certain definite and indefinite conditions the form 
of distribution in a large number of biological and other cases of 
"physical" measurement has been found to be either Gaussian (normal) 
or differing from normal in ways described by Pearson's family of 
frequency curves, we might, with some probability of being near the 
truth, assume the normal form of distribution in the given case and so 
* See Ebbinghaus, Grundzilge der Psychologie, 1905, pp. 75, 76. 
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obtain a quantitative measure for the abiKty of each particular boy*. 
A direct psychological determination and application of the (conven- 
tional) unit-distance is not, perhaps, an entirely impossible problem, 
and work in this direction may be expected and if achieved would 
certainly be much more scientific and psychological than the present 
method of measuring in terms of the external quantum of work done. 

Finally, the interrelations of different mental abilities within any 
well-defined group of individuals situated within any definite environ- 
ment may be determined by means of the technical method of " correla- 
tion." A correlation coefficient or other similar constant (e.g. correlation 
ratio) measures the tendency towards concomitant variation of two 
mental or other abilities within a group of individuals. The result may 
be transferred to any single individual within the group as measuring the 
degree of probability of connection of the two abilities in the particular 
case. The correlation between two abiUties may be due to an actual direct 
relation of the abilities to one another, or, indirectly, to the influence of a 
common external environment upon them both. The first of these two 
cases is perhaps the more important, but the possibility of the second 
should not be lost sight of, and it also has a special interest of its own. 
The problems of correlation will be considered more fully in a later chapter. 

(5) THE APPROACH TO MEASUREMENT BY MEANS OP GRADING 
MAGNITUDES AND THEIR DIFFERENCES 

Since the pubhcation of the first edition of this book an important 
symposium bearing on the question of mental measurement has been 
held (in 1913). The exact problem submitted to the joint meeting of 
the Mind Association, the Aristotelian Society, and the British Psycho- 
logical Society, was "Are the Intensity-differences of Sensation Quanti- 
tative?" Although many of the arguments of that discussionf are 
beyond the province of this book, it is of interest to note that there was 
a general consensus of opinion that sensation-intensities, and their 
differences, are at least "magnitudes" which can be graded, even if 
they be not "quantities" which can be measured. And following out 
further a suggestion contained in a quotation from Mr Bertrand Russell 
made by Professor Dawes Hicks in his contribution to the symposium, 
it may here be pointed out that grading leads, if the differences can 
also be graded, to something almost indistinguishable from, if indeed it 

* This was done, e.g. by Professor Pearson in his paper in Biometrika, 1907, v. p. 105, 
and the example has been followed with success by other workers. 

t The papers are published in the British Jawm. of Psychol. 1913, vi. pp. 137 — 189. 
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be not identical with, true measurement. For if we can arrange in order of 
magnitude a,b,c, ... and also their first differences a — b,b — c,c — d, ... 
and the differences of these differences in turn, and so on, we can space 
out the original quantities a, b, c, ... as accurately as though we used 
a unit and measuxed them. To take a simple example, suppose five 
quantities a, b, c, d, e have really the measures 10, 16, 20, 31, 32. If an 
observer, ignorant of these measures, only knows the order of grading 
a, b, c, d, e of the quantities, he has already made a considerable advance 
even although he does not know the spacing, or the distances apart. 
If however he further can grade the differences, that is, if he knows that 
the greatest difference is that between d and c, and that the others 
follow in the order b — a, c — b, e — d, he has advanced further towards 
measurement, in the sense of accurate spacing, although there are still 
many spacings that will satisfy these gradings. Thus far he can almost 
always go in mental phenomena. And although it is practically difficult, 
there does not seem any theoretical difficulty about taking the next 
step, and grading the differences of the second order. In our example 
this grading is ^d - c) - {b - a) or a, say, 

(c - 6) -{e-d) or j8, 
(6 — a) — (c — 6) or y: 
and the order of the third differences is 

(«-i3), (iS-y). 
If now we could have all these gradings we could space out the original 
quantities very closely indeed to their true positions. This can be best 
seen by attempting to alter some one of the values while leaving all 
these gradings unaltered. Make d, for example, 29 instead of 31 and 
although the order a, b, c, d, e is unchanged, and also the order of the 
first differences, that of the second differences is completely altered. 

With an infinite number of quantities, and all the gradings of all 
their differences, we should, it would seem, arrive at an exact solution 
of the problem, so that grading and measurement are not perhaps so 
different in their nature as might at first be thought. 

Indeed a case could well be made out for the thesis that the 
theoretical objections sometimes brought against mental measurement 
really hold in the last resort against all measurement, and prove too 
much: and that the real difference between mental measurement and 
physical measurement is simply that mental phenomena, being practi- 
cally more difficult to handle, force on our notice the epistemological 
difficulties inherent in all measurement, whereas in physical measurement 
familiarity has bred contempt. 



CHAPTER II 

THE ELEMENTAEY THEOEY OF PROBABILITY 

Some statistical terms — Arithmetical short-cuts — ^Measures of scatter — The funda- 
mental theorem in probability — The binomial expansion — The normal curve of error 
— ^Fitting a normal curve to distribution data — ^The method of least squares. 

(1) SOME STATISTICAL TERMS 

The theory of probability was developed chiefly from two classes of 
material, (a) games of chance such as coin or dice throwing, roulette, 
etc., and (6) statistics, as they are called, that is such collections of 
quantitative information as the census, trade returns, insurance data and 
the like. It i» easy to see that psychological experiments frequently 
resemble both of these classes. Any experiment, the result of which 
depends upon a human decision, has much in common with the throw 
of a die. In both cases it often seems mere chance what the result is, 
although we beUeve that in both cases this is due only to our ignorance 
of the numerous factors at work. And, since this is so, any scientific 
experiment on the actions and reactions of human beings must neces- 
sarily be repeated many times, until there accumulates a mass of 
quantitative information similar in many respects to a census return. 

The mass of quantitative information thus accumulated is found 
upon examination to have certain pecuharities or properties. For 
example consider the following case: — The experiments, carried out by 
Professor Urban in 1906 — 7, were on Hfted weights. A standard weight 
of 100 grams was compared, by hfting it, with weights of 84, 88, 92, 
96, 100, 104 and 108 grams. The standard weight was always lifted 
first, and as the second unknown weight was lifted the judgment lighter, 
equal or heavier, was given*. 

Suppose the following answers were obtained on one occasion: 
108 grams, answer heavier 



104 




»» 


equal 


100 




>9 


heavier 


96 




99 


lighter 


92 




>l 


equal 


88 




>» 


lighter 


84 




9> 


lighter 



* See "Die pBychophysischen Massmethoden als Gmndlagen empirischer Messungen," 
by F. M. Urban, Archivfur die gesamte Psychologie, 1909, xv. p. 261. 
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In this series the lowest answer heavier is at 100 grams. Let this experi- 
ment be repeated 400 times, and in every series let the position of the 
lowest answer heavier be recorded. In a particular case the distribution 
of these just perceptibly heavier points was as follows: 



Grams 


... 84 


88 


92 


96 


100 


104 


108 


Frequency ... 


1 


8 


36 


85 


143 


119 


8 



These particulars are shown in graphic form in the adjoining figure 
where the points have been joined by straight lines to make a polygon, 
which shows at once the chief peculiarities of such a collection of data, 




84 88 92 96 100 104 lOSgrams 

SHg. 2, A cocked hat curve (Urban's Subject III) 

namely that the points are not scattered anyhow, but occur most fre- 
quently at a central value from which the frequency falls off in both 
directions. Such a figure as this is colloquially termed a cocked hat. The 
point where the summit occurs, or point of greatest frequency, is called 
the mode. Here it is apparently at 100 grams, but if other weights 
between 100 and 104 grams could have been examined, it might be found 
to be elsewhere. The figure looks as though a smooth curve through the 
points would bring the mode somewhere a little higher than 100 grams. 
The middlemost of the 400 positions of the just perceptibly heavier 
point is called the median. It too is in the 100 gram group, though here 
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again its exact position is uncertain. A better known central value (and 
one whicli can be more exactly calculated) is the mean or average value, 
found thus : 

Ix 84= 84 

8x 88= 704 

36 X 92= 3312 

85 X 96= 8160 

143x100 = 14300 

119x104 = 12376 

8x108= 864 



Divide by 400 ) 39800 



99-5 grams 



In the above cases the only possible values of the sought for points 

were, by the nature of the experiments, at certain definite values. In 

another class of experiment, however, all values within the range are 

possible as in the following case. Twenty-nine experiments were made 

of bisecting a line by eye*. The lengths of the left-hand half of the line 

on these 29 occasions are given in this table, arranged in order of 

magnitude: 

63-8 mms. 

62-2 

62-1 

61-4 

61-3 

61-2 

61-2 upper Quartile 



611 




610 




60-9 




60-8 




60-6 




60-4 




60-3 




600 


Median 


59-9 




59-9 




59-6 




69-5 




59-2 




59-2 




591 


lower Quartile 



590 
58-8 
68-7 
58-6 
58-2 
681 
57-6 



29) 1743-7 

60-13 Mean. 

* Aii'"experimentXper£ormed for the purpose of illustrating this chapter. Subject, 
O. H. Thomson. 
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Here the median by counting is 60 mms. and the mean on calculation is 

found to be 60-13 mms. In this case the "cocked hat" is not at first 

sight evident, but if the data are grouped in some way it comes to light. 

,For example, they can be arranged thus: 

One reading from 63 to 63-9 mms. incluBive 

Two readings „ 62 „ 62'9 „ 

Six „ „ 61 „ 61-9 „ 

Six „ „ 60 „ 60-9 „ 

Eight „ „ 59 ,, 59-9 „ 

Kve „ „ 58 „ 58-9 „ 

One leading „ 57 „ 57-9 „ 

Here the numbers 1, 2, 6, 6, 8, 5, 1 show clearly the concentration in 
the middle, although being smaller they are not so regular as in the 
previous "cocked hat." Instead of concentrating these at points it 
seems more accurate to construct a diagram such as that here given 



S 



8 

7 


















- 






6 
5 
4 


















- 












3 


- 












2 

1 





































57 58 59 60 61 62 63 64 

Millimetres 

Fig. 3. A histogram formed from the bisection data 



where a rectangle of the proper height is constructed over the corre- 
sponding base. Such a figure is called a "histogram." 

This example is convenient for explaining two new terms, namely 
the "quartiles" and "semi-interquartile range." The "quartiles" are 
the readings which are one-quarter distant from each end, just as the 
"median" is halt- way. In the table on page 15 there are 29 readings. 



The haK-way reading is 



29+1 



or 15 from either end, and the quartiles 



29 4- 1 
are at the readings — j — or 7J from each end, for which we can take 

the mean of the seventh and eighth readings at 59-05 for the lower 
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quartile and 61* 15 for the upper quartile. The semi-interquartile range 
is half the distance between the quartiles. It is, in this case, 

61-15 - 59-05 , _^ 

2 = ^"^^- 

Clearly the semi-interquartile range is a crude measure of the scatter 
of the readings. When it is large the readings are more scattered, and 
therefore, any one of them is less likely to be correct, than if the semi- 
interquartile range had been small. 

The mean value is sometimes called the expectation. This term arises 
from games of chance. For example, suppose I am playing against an 
opponent at a game of dice, in which my opponent has to give me as 
many shilUngs as there are pips in a single throw of the die, then the sum 
which I ought to give him before each throw, in order to make the game 
an even one, is three shillings and sixpence, which is my expectation of 
gain. It is not the most probable sum for my opponent to give me ; indeed 
he will never give me this exact sum, since he always pays me in shilhngs 
and not in pence. But at the end of a sufficiently large number of throws 
I shall have received approximately this sum per throw on the average. 
From another point of view the mean will be found to correspond 
to a centre of gravity, 
namely the centroid of the 
curve of distribution of the ^ 
readings. Let the adjoining | 
figure represent such a curve, 
the number of readings 
which occur within a portion 
AB of the range being represented by the area included between the 
ordinates at A and B. Then the whole area of the curve will represent 
N the total number of readings. That is 




ISl = ydx 



where the integration is over the whole curve. Each reading is repre- 
sented by its position on the cc-axis, and the sum of all the readings is 



The mean is therefore given by 

\xydx 

jydx 



xydx. 



or 



^jxyda 



B. &Ta 
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But this is the expression for the abscissa of the centre of gravity of 
the curve, through which the ordinate above the mean therefore passes. 

(2) SOME ARITHMETICAL SHORT-CUTS 
There are several devices which considerably shorten the arithmetical 
labour of finding the mean. These can be illustrated by the example 
on page 15, near the top. 

Choosing a convenient origin. The occurrence of large numbers in the 
calculations is minimised if an origin is chosen within the actual range 
of distribution, preferably either at one end or in the middle. In the 
example we have in mind, the point 100 grams might be selected, as the 
large number 143 is thus avoided. The calculations would then appear 
as follows: 



Ix 


-16 




-16 


8x 


-12 




-96 


36 X 


- 8 




-288 


85 X 


- 4 




-340 
-740 


119 X 


4 




476 


8x 


8 




64 

540 

-740 






400 )j 


-200 
-0-5 






ongis 


I 100 
+ 99-5 Mean 



Choosing a convenient unit. Since in this particular case the measure- 
ments are all made at intervals of 4 grams, it is a considerable simplifica- 
tion if this is chosen as the unit. The same calculation then appears as 
follows: 



Ix -4 


-4 


8x -3 


-24 


36 X -2 


-72 


85 X -1 


-85 




-185 


119 X 1 


119 


8x 2 


16 




136 




-185 




400 ) - 50 




-•125 




4 grams working unit 




- -5 


origin 100 




+99-6 Mean 
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The Summation Method of finding the Mean 

This is only applicable in cases like the present where the readings 
are concentrated at a number of equidistant points. The calculation then 
appears as follows : 

84 grams 1 400 



88 „ 


8 


399 


92 „ 


36 


391 


96 „ 


85 


355 


100 „ 


143 


270 


104 „ 


119 


127 


108 „ 


8 


8 




400 


)1950 
4-875 

4 grams per unit 
19-5 
origin 80 

99-5 Mean 



The figures in the second column are the continued sum, from below 
upwards, of the figures in the first column. They are then themselves 
added up, the total being 1950. Now it is clear from its method of 
formation that this total is composed as follows: 

7x8-f6x 119 + 5x 143 + 4x 85 + 3x 36 + 2x8 + 1x1. 
That is to say, it corresponds to taking the origin at 80 grams, and a 
unit of 4 grams. 

It is instructive to try this calculation from the other end, as it were, 
making the continued sum from above downwards. The reader should 
try other modifications of the idea underlying this summation method. 
For example, how should it be carried out in order to place the origin 
at 84 grams? 

(3) MEASURES OF SCATTER 

Let us consider again the experiment of bisecting a line, described 
on p. 15. Since the hues bisected measured 126 mms., the true half 
lay at 63 mms. The mean of the trials was 60*13 mms. so that an error 
of 2'87 mms. was made. This error does not in itself however give a 
complete description of the subject's performance; a measure of scatter 
is required. One such is the semi-interquartile range. When we say that 
the semi-interquartile range is 1-05 mms. we mean that half the attempts 
at bisection lay within a range of 2-1 mms., and, since half the trials 
already made come within this range, the probability of the next trial 
also coming within it is one-half, apart from practice improvement. 

2—2 
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We must, however, next consider other and more usual ways of 
indicating scatter, although the use of the semi-interquartile range as 
a check is most valuable, since it is a quantity about which glaring errors 
are not likely to be made. 

(1) There is first the "mean variation," a measure much in use 
among psychologists but not one to be recommended. In this the 
deviation or variation of each reading from the mean is written down, 
and the mean of these found, disregarding their sign. Taking the same 
example of bisecting a hne we obtain 

Sum of deviations regardless of sign , ^ . 

M.v. = TfT — r T-^ ^- = 1-14 mms. 

dumber oi cases n 

The mean variation is often, as here, about the same size as, or a little 
larger than, the semi-interquartile range, which latter, it will be seen, 
is indeed the median variation, from the median, though this name is 
not used. 

(2) The "standard deviation," generally denoted by a, is by far 
the best measure of scatter to use, for reasons which will gradually 
become clear in the process of studying the subject. Its long title i& 
the "root mean square deviation" and it is obtained by squaring each 
deviation, adding the squares all together, dividing by the number of 
readings and taking the square root : 

/Sum of squares of deviations 



"V^ 



Number of cases n 



If this process be carried out the value a = 1*38 will be found for our 
example. 

This is admittedly a longer process than finding the mean variation. 
It can be simplified by the following device, which should be used 
whenever the mean of the readings is a number involving awkward 
decimals. Instead of taking the deviations from the real mean, here 
60'13 mms., let them be taken from some convenient point, here say 
60 mms., chosen by mere inspection. Proceed now exactly as before, 
squaring and adding the deviations. But from the mean of the squared 
deviations must be deducted the square of the distance between the 
real mean and the convenient point from which the deviations have 
been taken (see example opposite). 

The proof that this then gives the mean of the squares of the real 
deviations is easily obtainable by elementary algebra. 

This shows one very important property of the mean, namely, that 
it is the point where the sum of the squares of the deviations is a 
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minimum: for the provisional mean square is always greater than the 
real mean square, since the correction subtracted is essentially positive. 

Bisection data (page 15) 

Deviations from 60 mms. Squares of preceding 

3-8 14-44 

2-2 4-84 

2-1 4-41 

1-4 1-96 

1-3 1-69 

1-2 1-44 

1-2 1-44 

11 1-21 

1-0 1-00 

09 0-81 
0-8 0-64 
0-6 0-36 
0-4 016 
0-3 0-09 

00 000 

01 001 
01 001 
0-4 016 
0-5 0-25 
0-8 0-64 
0-8 0-64 
0-9 0-81 

10 1-00 
1-2 1-44 
1-3 1-69 
1-4 1-96 
1-8 3-24 
1-9 3-61 
2-4 S-76 

29) 65-71 

1-92 provisional mean square 
Subtract 013' =0-02 

1-90 real mean square 
Vl-90 = l-38=(7. 

In this section we have, throughout, taken the mean as the central 
value from which the scatter was to be measured. In the great majority 
of cases this is the practical plan ; but if necessary, measures of scatter 
from some other central value could be used. Indeed as has been 
pointed out, the semi-interquartile range is the median of the deviations 
from the median. 

If the distribution of readings is represented by a smooth curve as 
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in the diagram on p. 17, then it will be seen that the mean square 
deviation about the origin is given by the expression 



j^jx^ydx 



so that we have the equation, symbolising the above calculation, 

o^ = ^ I xh/dx — Mean^, 

the integrals being as before over the whole of the distribution. 

In practical work, instead of actually giving the standard deviation, 
it is more usual to quote a quantity called the "probable error," which 
may, for the present, be arbitrarily defined as equal to -67449(7, i.e. it is 
an arbitrary reduction of a. To the meaning of this reduction we shall 
return presently. 

The Standard Deviation about the True Value 

In the section above we defined a quantity known as the Standard 
Deviation about the Mean. From another point of view, however, we 
sometimes require the value of the Standard Deviation about the True 
Value of the quantity which is being measured. This will always be a 
Uttle larger than the former quantity, unless the true value happens to 
coincide exactly with the Mean. This follows from the theorem that 
the sum of the squares of the deviations is a minimum at the Mean. 

Of, course if we do not know the true value of what we are measuring, 
we shall be unable to find this quantity exactly, but it can be shown 
that it is approximated to if we divide by w — 1 instead of n after 
finding the sum of the squares of the deviations. 

Let a be standard deviation about mean a, and a' standard deviation 
about true value A. 

Let «!, fflg, ag, ... a„ be the readings. 

Let Ci, 62, 63, ... e„ be the true errors of those readings, and let e be 
the error of the mean a. 

Then a - ai = A + e - (A + e^) = e - e^^, 

a — ttj = e — eg, etc. 
Therefore 

^ ^ S(a- ffl,)^ _ S{e- e^Y ^ ne^ - 2eS je^) + S (e^^) 



n 



Now wa'2 = S (ej,2) 

and ne = S (gj.). 
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so that we have 

The last term is approximately zero, the positive errors cancelhng the 
negative. 

We have finally 




8 (e.^) ,2 . 
= cr — ■ 



••.-7' 



S (deviation*) 



n- 1 

It will be seen that when n is sufficiently large the two quantities 
become practically identical. It is when n is small that the correction 
becomes of importance. This is especially seen from a consideration of 
an extreme case, namely where only one measurement is made. 

Let us say that the one measurement made has the value v. Then 
the mean has also the value v, the deviation is zero, and the standard 
deviation is therefore the square root of zero divided by unity. That is, 
the standard deviation about the mean is zero. The standard deviation 
about the true value is, however, by the above rule, indeterminate, 
being the square root of zero divided by zero. 

The reciprocal of the standard deviation is frequently used as a 
measure of the accuracy of a set of observations. It is clear from the 
above that if the number of observations is small it is the standard 
deviation about the true value which must be used, that is we must 
employ w — 1 in the denominator instead of n. For otherwise the 
accuracy of a single measurement would be infinite. 

The Standard Deviation of the Arithmetical Mean 

In a section above it was shown that the mean square of the devia- 
tions of a set of readings is a minimum at the arithmetical mean. About 
any other value distant e from the mean this mean square is increased 
by e*. 

Now the mean square deviation about the mean is a^. But the mean 
square deviation about the true value is na^/{n — 1). Therefore the 
expected square deviation of the mean from the true value is given by 

g2 = -o^= . 

n— 1 n— I 
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The expected square deviation (before experience) is, however, the same 
thing as the mean square deviation (after experience), so that the 
standard deviation of the mean about the true value is therefore 



V(n - 1) 

and that about the mean of the means will be 

cr/-\/n. 

The standard deviation of a mean is therefore obtained by dividing the 
standard deviation of the whole distribution by the square root of the 
number of readings. 

The Standard Deviation of Sum or Difference 

Let a: be a quantity whose standard deviation is a^, and y a quantity 
whose standard deviation is Vyi required the standard deviation of the 
sum x + y. Let m^ be the mean value of x and m„ the mean value of y. 
Then any single value of the sum x + y will be of the form 

mx + S^ + my + Sy. 

Moreover the mean of the sum aj + ^ is equal to m^ + my, so that the 
deviation of the above reading is 

The mean square deviation of the sum x + y is therefore 

o^xH, = S{8^ + 8„)> = S (S^^)/n + S {S^8y)/n + S M/n. 
The quantity S {S^ S„) however will be very small if not zero, because 
the factors S^, and S^ are not connected with one another in any way, 
and their products are as likely to be positive as negative, so that the 
positive values will annul the negative on the average. We have, 
therefore, finally: 

ruy%^=S(S/) + Si8y^) 
= na/ + noy^. 

Exactly the same reasoning holds for the value of a^_y, the only 
change being in the sign of my and 8„, and therefore of S (Sj,S„), which 
is however zero. The final result is the same 

ffx-,, = V{<rJ^ + V)- 
It is assumed in both these formulae that x and y are independent and 
uncorrelated. 
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(4) THE FUNDAMENTAL THEOREM IN PROBABILITY 

In ordinary usage, when we say that an event is probable under 
certain circumstances, we mean that it is more likely to happen than 
not to happen, and by improbable we mean that it is more likely not to 
happen than to happen. 

If we cross-question ourselves as to why we think an event is probable 
we often find that it is because we have more frequently found it to 
happen than to fail under similar circumstances in the past. 

If we wish to apply mathematical treatment to probability we must 
decide on a quantitative measure for it. We do so by using a fraction 
(vulgar or decimal) for this purpose, in such a way that the fraction 
rises and falls with the probability, becoming unity for "certain to 
happen" and zero for "certain not to happen." The numerator of this 
fraction is the number of equally probable ways in which the event can 
happen, while the denominator is the total number of equally probable 
ways in which the event can, under the given circumstances, either 
happen or fail*. 

Thus, for example, consider the probability that with one throw of 
a six-faced die a score of more than four will be obtained. This can 
happen in two ways, namely, by throwing a five or a six. The total 
number of possible throws is six, and therefore the probability is f or ^. 

This method of giving a quantitative value to a probability is clearly 
connected with the method adopted in betting. For instance, odds of 
"3 to 1 against" a certain event means that the speaker judges that 
there is only one chance of success to three chances of failure. The 
fraction representing the probability of success is therefore 

1 1 

3 + 1 4" 
If the experiment with dice mentioned above be actually performed 
a large number of times, it will be found that the number of occasions 
on which the score exceeds 4 will closely approximate to one-third of 
the whole. If therefore we had never seen dice and had no idea of their 
appearance, but were told that a large number of throws always included 
about one-third which were over fourf, we should conclude that the 
probability of obtaining a throw of more than four was about one-third. 

* Cf. however Jeffries and Wrinoh, Phil. Mag. 1919. 

t The late Professor Weldon found 106602/(12 x 26306) =0-3377. See Phil. Mag. 
June 1900, article by Professor K. Pearson. 
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This method of finding probabilities, by deducing them from a large 
number of actual experiments, is that followed most frequently in 
practice. 

For example if the points of an aesthesiometer are applied to a 
subject's forearm, with the points distant 3 cms. from one another, 
the average subject will usually recognise that two points are present. 
Occasionally however he will only feel one point. What is the proba- 
bihty that he will answer two? This can only be decided by experiment. 
In an actual case, 150 trials were made at a number of different sittings. 
On 105 occasions the answer two was returned. If the conditions have 
been the same throughout then the probability of an answer two 
under these conditions is {^ = 0-7. 

If the probability of an event occurring is p then the probabiUty of 
it not occurring is clearly 1 — p, for which q is often written. For the 
event is certain to happen or not to happen, and therefore the two 
probabilities must add up to " certainty " that is to unity. Thus in the 
above case the probability of an answer two not being returned is O-S. 
Similarly the chance of throwing an ace at dice is \ and of not throwing 
an ace is f. The chance of obtaining a head on throwing a coin is |, 
and the chance of not obtaining a head is |. 

Next consider the probability of an event happening twice in suc- 
cession, if its probability is p for one occurrence. Take a few specific 
cases first. If two coins are thrown or one coin thrown twice, there are 
four equally likely things than can happen, namely: 



bead 


head 


tail 


tail 


head 


tail 


tail 


head 



The probability of getting two heads is therefore J which, it will be 
noticed, is equal to (\Y. 

Next consider two throws of a six-faced die. There are here no less 
than 36 things which can happen, for whatever value the first throw 
has there are six different throws of the second die which can be associated 
with it: and siuce the first can also have six values there are 6 x 6 or 
36 combinations possible. Only one of these combinations consists of 
two aces, so that the chance of throwing two aces is ^ or (\)^. Generally, 
if the chance of an event is p then the chance of it occurring twice in 
succession is f^. For suppose there are m ways in which it can happen, 
and n ways in which it can fail, then f = ml(m + n). Also, having 
happened once, there are m ways in which the second success can occur, 
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each of which ways might be associated with the first success; and since 
there are also m ways of this first success happening there are in all 
m? -wajB of the double success happening. Similarly there are in all 
(m + nY ways of the double event either happening or failing, and 
therefore the chance of a double success is 

m' _ 2 
(m + n)^ ^ ' 

Reasoning exactly similar to the above will enable the reader to 
convince himself of the truth of the following general theorem : 

If there are a number of independent events a, b, c, ... etc. and their 
respective probabihties of occurrence are pa, Pb^ Pc> ••• etc., then the 
probability of all occurring is the product 

PaXPbXPoX —■ 

This may be said to be the fundamental proposition in probabiUty. Its 
use will be recognised best by an example. It will be instructive to take 
a real psychometric experiment, that already described in the experi- 
ment on weight-lifting on p. 13. The subject of those experiments com- 
pared each of the weights with the standard 450 times. The results of 
this were as follows* : 



Gramfl 


84 


88 


92 


96 


100 


104 


108 


No. ol answers heavier 





11 


32 


100 


212 


402 


431 



From these the probability of the subject answering "heavier" at any 

weight can be calculated, always assuming that the conditions have 

throughout remained the same, by dividing the above numbers by 450. 

We thus obtain the following: 

Grams 84 88 92 96 100 104 108 

Probability of 



L"] »•* 



, , . „ , 0-0000 0-0244 0-0711 0-2222 0-4711 0-8933 0-9578 
answer "heavier J 

Let US now apply our theorem on the combination of probabilities 
to solve the following problem: 

Let the weights 84, 88 grams etc. be each presented once to the subject. 
What is the probability that the lowest answer "heavier" will be at 96 grams? 

The probabihty that he will answer "heavier" at 96 grams is 0-2222. 
The probability that he will not answer "heavier" at 92 grams is 
1 _ 0-0711 = 0-9289. Similarly the probability that he will not answer 
"heavier" at 88 grams is 0-9756, and at 84 grams is 1-0000. The proba- 

* Urban, op. cit. p. 287, Table XI (multiply the values by 450). Or alternatively 
consult Table V, p. 175, of The Application of Statistical Methods to the Problems of 
Paychophysics, by F. M. Urban, Philadelphia, 1908. 
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bility of the combination happening, namely, an answer "heavier" at 
96 and none below, is, therefore 

1-0000 X 0-9756 x 0-9289 x 0-2222 = 0-2014. 
The actual frequency with which this occurred was 0-2125. 

(5) IMPORTANCE OF THE BINOMIAL EXPANSION IN THE 
THEORY OF PROBABILITY 

By a not unnatural hypothesis, which has been widely accepted and 
has proved very fruitful, an error of measurement, such as that made 
in judging the half-way point in a line in an experiment above, may be 
looked upon as the resultant of a large number of small circumstances, 
each of which sometimes sways our measurement in the one direction, 
sometimes in the other. This hypothesis, combined with the fundamental 
theorem of probability just explained, leads to the use of the binomial 
expansion in describing distributions of error. This can be illustrated 
by the following example: — Let us suppose that a quantity which we 
desire to measure has really the value 13f units, but that we are opposed 
in our efforts to measure it by seven "Djinns," each of whom has the 
power of displacing our measurement by one half-unit. Let us further 
imagine that each of these mischievous imps, in an endeavour to prevent 
our making any steady measurement, decides that he will add or deduct 
his half-unit according to the throw, heads or tails, of a coin. Whenever 
we try to make a measurement, therefore, these invisible seven will 
assemble and throw each his coin in the air. If all the coins happen to 
come heads, seven half-imits are cunningly added to the 13J, and we 
obtain a measurement of 17. If all come tails, we get 13| minus 3|, 
or 10. If on another occasion five are heads and two are tails, five half- 
units will be added and two subtracted, giving 

13| + I - I = 15. 
An actual test of this is given in the following figures and diagram. 
Seven coins were thrown on 128 occasions, and each time the proper 
number of half-units was added (for the heads) or subtracted (for the 
tails) from 13 J. The result was as follows: 

heads and 7 tails ooourred 2 times giving a value 10 units 



1 „ 


„ 6 „ 




8 


. ,. .. 


, 11 


2 „ 


„ 5 „ 




16 


t )» >> 


. 12 


3 „ 


„ 4 „ 




38 




13 


4 » 


„ 3 „ 




43 


> » >r 


, 14 


5 „ 


„ 2 „ 




16 


* ft ty 


, 15 


6 „ 


„ 1 „ 




2 




, 16 


7 „ 


„ „ 




3 


, „ „ 


. 17 






Total 128 






This distribution 


is shown 


in Fig 


.5. 
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The general resemblance between the diagram, made by throwing 
coins, and previous diagrams which represent the result of psychological 
experiments is not surprising if we consider for a moment what the 
condition of such experiments are. The "Djinns" which oppose our 
efforts to obtain a true value for say the spatial threshold are in- 
numerable. Some are in the fingers of the experimenter, and make him 
press irregularly on the aesthesiometer points. Others cause noises to 
happen in the neighbourhood to distract the subject's attention. Other 
Djinns make the instrument hot one day and cold the next, others live 
in the subject's skin, and quite a lot are engaged in stirring up vivid 
imaginations in his mind so that he feels all kinds of prickles and 
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Kg. 5. Number of heads in a throw of seven coins in 128 repetitions 



tinglings which make his judgments on the position of the points as 
erratic at times as is the throw of a coin. 

In the figure, there is shown, in addition to the polygon based on 
experiment, a dotted theoretical polygon to which we now turn our 
attention. Since the probabihty of obtaining a head at one throw is J, 
the probabihty of obtaining seven heads in a throw of seven coins is 

(i)' or Tk- 

The probability that a certain coin will give a tail but all the others 
heads, is (J)^ x (1 — |) which also equals y^. As there are seven coins 
in all, each of which might give the only tail, the total chance of 
obtaining six heads and one tail is ■^. The probabihty that two 
specified coins out of the seven will give tails, the other five giving heads, 
is (1)5 X (1 — 1)^ = Y^. The number of ways in which the two can be 
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specified is 21. The total chance of obtaining five heads and two tails is, 
therefore, 21/128, obtained thus, 

7! 



/ly /1\B ^ ^ 
U/ \2/ ~128" 



5!2! 

It will be seen that the above probabihties of obtaining seven, six or five 
heads are the first three terms in the expansion of (| + |)'. The next 
term will similarly be found to be the probability of obtaining four 
heads, and so on. We thus obtain the following results : 

Probability of 7 heads 1 in 128 

>» »» o f» ' j» 

» .. 5 ». 21 „ 

»» » 4 tt ^^ « 

»> »» 3 „ 35 „ 

„ 2 „ 21 „ 

„ „ 1 head 7 „ 

„ „ no heads 1 „ 

128 

It is these numbers which are shown by the dotted line in the figure. 

In general, if p is the probability of an event succeeding, and q of 
it not succeeding, the respective chances of it succeeding 

k,J!i-l,k-2, ...3,2, 1 or 

times in k trials are given by the terms of the expansion of (p + y)*. 

In n groups of k trials therefore the most probable numbers of 
successes in the various groups are 

w 1^ + kp^-^q + ^ ^^ ~ -^^ p^-^q^ + ... + ?*}. 

For example let the event be throwing either an ace or a six with a 
six-faced die, let six trials be made in each group, and let a thousand 
groups be tried. Then the above expansion is 

Therefore the number of times a throw consisting of only "aces" or 

"sixes" will occur in 1000 trials with six dice each time is most probably 

1000 1000 . X , ^T, i • i T .1. 

-sj- = -=Kq- or once, to take the nearest mteger. In the same way we 

get this table: 
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1000 throws of six dice each time 
All 6 are "Aces or Sixes" on -=5^- or approx. 1 occasion 
Only 5 „ „ „ „ 

4 

o 

5> "^ »» »> »» J» 

J» ^ >l »» >» J» 

„ 1 is an ace or six 
None are aces or sixes 



12000 

729 

60000 

729 

160000 

729 
240000 

729 

192000 

"729 

64000 

729 



16 occasions 

82 

220 

329 

264 

88 
1000 



(Mode) 



Here the mode is at two. 

What is the msan'i In calculating it we shall take the exact fractions 
in the above table, not the approximate integers. The mean is then 
obtained thus: 

6x 1000 -=-729= 6000 H- 729 

5x 12000 -=-729= 60000 H- 729 

4x 60000 H- 729 =240000 -=-729 

3 X 160000 -=- 729 = 480000 -=- 729 

2 X 240000^729=480000 -=-729 

1 X 1920004-729 = 192000^729 

Ox 64000-=- 729=nU 

Here, therefore, the mean coincides with the made. 
In general we have from the binomial expansion 



„ 1458000 .... 

— 729~^ 

T>T 2000 „ 

*'«'^" = 1000=2 



ph + kj)^-iq + 



k(k-l) 



1.2 



Mean equals 



p^-''q''+ ... 



k(k-l) 



Jcpk +{k-l) kp^-^q -K^ - 2) ^ 2 P^'^f + ••• 

= kp |j>«=-i +{k-l) p^-^q + ^^ ~ ^^ ^^ ~ ^^ p«=-3g2 + , 

= kp {p + q)^-^ = kp since p + q=l. 
The made can also be deduced from the expansion. The first term in 
the expansion, p, corresponds to k successes, the second term to ^ — 1 
successes, and so on, so that the term corresponding to any number m 



of successes is 



kl 



m ! (k — m)\ 



/nTn nk—m 
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This term is made from the preceding term by multiplying the latter by 

m+ 1 q 

k — m p 

and the succeeding term is made from it by multiplying by 

m q 

k— m+ \ p' 

It will therefore be the largest term if 

m + 1 ff , m q 

• — > 1 > • — 

k — m p k — m+ 1 p' 

whence qm + q> pk — pm, pk — pm + p> qm, 

qm + pm >kp — q, kp + p> qm + pm, 

m > kp — q, kp + p> m, 

so that kp ~ q < m < kp + p. 

The greatest term, therefore, is that which corresponds to a number 
of successes between kp — q and kp + p. The range thus indicated is 
unity since p + q = 1. In the binomial expansion therefore the mean 
and the mode agree to an integer. 

Standard Deviation of the Binomial Expansion 

For the binomial expansion the standard deviation is equal toy{kpq). 
This can be proved as follows : 

In the expansion of {p + q)^ the first term p" represents the proba- 
bility of the specified event succeeding k times. The mean number of 
times it succeeds is kp, so that the deviation is k — kp and the first 
term in o^ is ^3* x {k — kp)^. Similarly the second term Ap*-^ q of the 
binomial represents the probabiUty oi (k — 1) successes, and the devia- 
tion is therefore (Jl; — 1) — kp, so that the second term in o^ is 

Remembering that k — kp = kq we get the following expansion for a^ 
o^ = {kqf '^ -f {kq - 1)^ k'^-'^ i + {kq- 2f ^ ^^ ~ ^^ p^-^ q^ + ... 

= kY (p + qf - 2ky (p + q)"-^ + lip*-! q + i ^ ^^ ~ ^^ p^-^q^ + . 

= kY - 2^?^ + kq {p*-i + 2{k-l) p*-^ q+ ...} 

- kY + kq {{p + j)*-i +{k-l)q{p + q)"-'} 
-kY+kq{l + {k-l)q} 

= - ky +kq(l + kq-q) 

= kq (1 - q) = kpq, a = V(^Pq)- 
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(6) THE NORMAL CURVE OP ERROR 
We have seen that a binomial expansion gives a cocked hat figure 
very like the actual diagrams obtained by experiment. In the imaginary 
example which we considered, seven Djinns added or subtracted each 
a half unit to or from the quantity 13| which we were trying to measure. 
We then obtained measurements extending from 10 to 17 units, but 
always at the exact units. 

In practice if we were handling a not easily measurable quantity 
we might well find our measurements range over this distance, but 
unless they were constrained to do so by some pecuharity of the method, 
they would not always occur at exact units, but at any distance. This 
case is covered if we imagine the number of Djinns (the factors of 
accidental error) to be much increased but the influence of each one 
made less : thus there might be 7000 Djinns each adding or subtracting 
a mere fraction of a unit, or ultimately an infinite number of them, each 
adding or subtracting an infinitesimal amount, just as an infinite number 
of the tiniest errors (we may well imagine) account for the variations 
in our experimental readings. 

What would the binomial expansion then become? That is, what 
form does the expansion of (^ + |)* take when k becomes infinite and 
the terms are not unit distance apart but only an infinitesimal distance 
dxl The whole range covered by the expansion is kdx, and the term at 
either end which occurs when all the errors are either positive or negative, 

is at a distance k -^, from the middle, so that each elementary error is 

now dxl2 just as formerly it was half a unit. 

The term which corresponds to I errors being positive and k — I 

negative IS P = ( i)J (i)*-' A; !/{Z ! (jfc - Z) !} (1), 

and the net error, the abscissa of this point, is 

x = {l-(k-l)}dxl2 (2). 

The next point, distant dx, is that corresponding to Z + 1 positive and 
k ~l —1 negative errors, giving an abscissa of 
{l+l-{k.-l-l)} dxl% 
Its probability is 

P + dP = i^rkm +l)\{k-l-l) !}. 
The ratio of these is 

P + dP _k-l dP , dP k-2l-l 

p ~z + i" + P' ■■ P ~ l + l ■ 

B. & T. 3 
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But ' = T- + 9 » i^om eqn. (2) for x above, so that 

dP 2 (2a; + dx) 

~P~~ 2x + {k -{■ 1) dx' 
Now we are going to make k, the number of atomic errors, equal to 
infinity, and we can therefore neglect the 2 in A + 2 and write 
dP 2{2x + dx) ix 2dx 



P 2x + kdx 2x+kdx 2x + kdx 



■(3). 



Therefore j-, which is the quantity we require before we can integrate 

and obtain the equation to the continuous curve to replace the binomial 
cocked hat, is given by 

_1^=_^? + _J_ (4) 

P dx 2xdx + hdx^ 2x + kdx ^ '' 

We must now consider the quantity kdx which gives the entire 
extent of scatter, the whole range. The number of errors k is infinite, 
we have assumed, and dx is infinitesimal. The range kdx may then 
either be finite or infinite. If we assume it to be finite, then kdx^ will 
be infinitesimal and the above equation becomes 

_ 1 ^ - i^ 4. 2 _ • fi .. 

P dx 2xdx '^ 2x + kdx'' ^^^^^^ 

when dx becomes infinitesimal. Or ^ = — infinity, except when P = 0. 

In this case the probability falls off infinitely quickly from the mean, 

and this is therefore a case of no scatter at all. If therefore we postulate 

an infinite number of elementary errors, we must allow a possible range 

of scatter of infinite extent, the only alternative being no scatter at all. 

But although the possible range is infinite, it will be found that at 

infinity the probability is infinitesimal, that is the larger errors do not 

in practice occur. 

We take therefore , , 

kdx = 00 , 

. k (dx)^ = a finite quantity*, 

which we shall write = ia^. 

(a will turn out presently to be our previous acquaintance the stan- 
dard deviation.) We then have from eqn. (4) 

IdP ix X 

P dx 2xdx + 4:a^~o^ ^^'' 

* Here again the alternative ought to be Investigated. 
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and on integrating, 

log P = — n-^ + constant, 

or P = Ce-'=V(2^') (6). 

This equation gives the probability P of any value of the error x 
occurring, and not only of any integral value, as the binomial does. 
The value of the constant C will be found presently, and it will also 
be shown that a is, as has been already asserted, the same standard 
deviation which we abeady know in another guise. The general shape 
of such Normal Curves is shown by the example in Fig. 6, p. 43. 

Some properties of the Normal Curve 

In the curve P = Ce'" '^ \ P is the probability of the occurrence 
of an error x. Let us consider of what order of magnitude the quantities 
P are. In the first place we remember that the curve was found as the 
limit of the expansion (^ + |)*, when k was made infinite. This curve 
however flattens out more and more as k is increased. 

For example 

So that clearly when k becomes infinite, the ordinates of the cxirve, as 
we have so far considered it, flatten out so that they are all infinitesimal. 
In other words the probability P of any exact value of x occurring is 
really infinitesimal though it varies from one x to another x. G must, 
therefore, be infinitesimal. Let us try putting 

C = C'dx, 

so that if C is of the same order of smallness as dx, the new quantity C 
will be a finite constant. We have then 

P=C'e-»''/(2<r»)^a. (7)_ 

This means, in geometrical language, that instead of using a curve whose 
ordinate P measures the probability of the occurrence of x, we had 
better use a curve where this probability is measured by the area of an 
elemental rectangle contained by two ordinates enclosing x, and distant 
dx from one another (cf. Fig. 4, p. 17). Such a curve will be similar in 
shape to the former but oi finite ordinates. 

We have assumed in the above argument that C as there defined 

3—2 
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is a finite quantity, that is that G is of the same degree of smalhiess 
as dx. That this is so may be shown by proceeding next to find the 
actual value of C in the foUoAving manner. 

It is clear that the sum of the probabilities of all errors must equal 
unity, for at any one measurement it is certain that some error or other 
must occur, if we include zero error in the mathematical sense. That is 

00 

SP=1. 

— 00 

Substitute for P from the equation (7) above and replace the sign of 
summation by integration. We thus obtain 



C're-^V(2<r»)da!=l (8). 

J —00 



-f. 



Write a;7(2ff2) = z^, xdx = 2<jHdz; 

then the integral becomes 

e-^'^^^= C'aV2.25* = C'ay/(^) (9). 

Whence, since this equals unity, we have 

C = 1/(^^2 -v/tt), a finite quantity (10). 

So that we can now write 

^=,-^)^-"""' <"'■ 

The ordinate of the curve defined on p. 35 is y = P/da; so that its 
equation is -i „ » 

^-^im'-^"^ "^)- 

Such a curve is called a probability curve. The probability of the 
occurrence of the value x is given by y^^x and the probability of x falling 

between a and 6 is given by ydx. The total area I ydx equals 

unity. 

If N measurements were made, and the errors were distributed 
according to such a curve, then the most probable number of times a 
deviation x occurred would be NPa,. A curve 

^ C7V(27r) 
is similar to a probability curve, but each ordinate is N times as tall. 

* The reference is to Integral B in Appendix II giving the values of a number of 
integrals of general use in the theory of probability. 
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It is a distribution curve and its total area is not unity but N, and the 
integral from a to 6 represents not the probability of x falling between 
a and b, but the most probable number of times it would fall in this 
range out of N trials. 

Since this curve is symmetrical, the mean, mode and median coincide. 

Let us find the standard deviation of such a set of measurements. 

The sum of the squares of the deviations is given by 

(write aj2 = 2a^z^, xdx = 2cihdz) 

CT-\/(2n)J -00 zay/2 ^J-rt 

The mean square deviation is Na^jN = a^. 

The root mean square deviation, or standard deviation, is therefore a, 
so that we were justified in using this letter on p. 34 when we wrote 

k {dxf = 4ff2. 
The quantity a can be shown to have another significance in the normal 
curve, namely it is the distance from the centre to the point of inflection 
or point where the curve changes from convex to concave. The proof 
of this statement is as follows. At a point of inflection of a curve, the 
value of d^yjdx^ is zero. In our case 

If this is to equal zero then a? must equal o^. 

Finally, if we suppose the area enclosed between a distribution curve 
and the axis of x to be spinning round the axis of y, then the moment 
of inertia is such that we can consider the whole weight concentrated 
equally at the two points of inflection. In other words, a is the "radius 
of gyration." For the area (or weight) of each vertical elementary 
column of the curve is ydx and its distance from the axis of gyration is x. 

The moment of inertia is therefore 

* See Integral C, Appendix II, p. 202. 



38 PSYCHOPHYSICS [pt.i 

Write a;* = 2o-V and the integral becomes 

Therefore the N can be considered as concentrated at a distance a from 

N 
the centre, or -^ at each point of inflection. 



The Relation between Mean Variation and Standard Deviation, 
in the case of Normal Distribution 

The distribution is theoretically symmetrical and the mean of the 
positive variation ought to equal the mean of the negative variation, 
apart from sign. We have therefore 

Six) 
M.v. = ^^. 

S indicating sum from to oo of the variations x. Remembering that 
the frequency of occurrence of each x is 

we replace the sum S by an integral, obtaining 



N r 

ctV(2w) Jo 



dx. 



rV(277)Jo NI2 
Writing x = -y/^o^f and using integral A of our list of definite integrals 
(indeed this expression integrates) we obtain 

^2 o if" t -f^j4 2V2 1 V2 
M.V. = -^ 2a^ |e ^ d^= —7- o- x 5 = -V «^- 

\/2 
The numerical value of ^ is -8 approx. 

The Relation between Probable Error and Standard Deviation, 
in the Case of Normal Distribution 

The probable error was defined on p. 22 as an arbitrary reduction 
of the standard deviation, viz. 

p.E. = -674:50. 

In the case of Normal Distribution a physical meaning can be given to 

* See Integral in Appendix II, p. 202. 
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this quantity. Consider the number of cases which fall within the limits 
of the range ±-6745ff. This number is 

aV(27r)Jo 

Values of the probability integral have been calculated and tabulated 

for all limits, and if such a table be consulted* it will be found that this 

N 
quantity has the value -^. That is to say, the probable error gives the 

range within which one-half the cases may be expected to fall. It is 
more instructive to remember that half the cases may be expected to 
fall outside ± p.b. 

With skew distributions this meaning ceases to hold, and for these 
a should be used. Indeed it is in general a better quantity to employ. 

(7) ON FITTING A NORMAL CURVE TO DISTRIBUTION DATA 

Consider the experiment of bisecting a line of which the data are 
given on p. 15. The histogram of Fig. 3, p. 16 represents these data, 
and shows the density with which these points occur in each part 
of the range. We wish to replace this stepwise figure by a smooth 
Normal Curve which will give us at each point of the range the theoretical 
proportionate density of the bisection points at that spot, and we want 
this Normal Curve to be the most probable which can be based on the 
given data. 

The equation of the required curve is 

v=— e-<'"-«)'/(2''') 

^ crV(2w) 

where N is the number of experiments made, x is measured in mms., 

and {x — a) is a quantity measured from some central point of the data. 

In the theoretical curve (which is symmetrical) a is the point where 

mean, mode and median coincide. In the data however the mean is 

60-13 and the median 60 while the mode is unknown. What value shall 

we give to a so as to obtain the best fitting curve? 

To answer these questions it is necessary to consider what we mean 
by best fitting curve. 

The meaning of "best fitting curve" is really the same as the 

meaning attached to the phrase "most probable theory." By the best 

theory of any set of data we mean the theory from which the observed 

data could have chanced to spring with a greater probabihty than would 

* E.g. the first table in Pearson's Tables for Statisticians and Biometncians. 
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be the case with any other theory. Take a simple example. Suppose 

a bag is known to contain a large number of equal-sized balls and nothing 

else: but the colours of the balls are entirely unknown. Experiments 

have been carried out to learn something about their colours, each 

experiment taking the form of extracting one ball, noting its colour, 

and replacing it. Ten such experiments have been made and the results 

are as follows : 

5 black balls, 
3 white balls, 
1 red ball, 
1 green ball. 

What is the best theory of the composition of the bag? 

The best theory on the facts as given, is that the bag contains 
^ths black balls, ^ths white balls, J^th red balls, and ^l^th green balls. 

Suppose however that another theory was advanced, namely that 
the bag contained ^ths black balls, ^%ths white balls, and ^th each of 
red, green, yellow and blue balls. How should these two theories be 
compared? 

The proper plan is to find the probabiUty, on each theory, that 
ten dips would result in what was actually observed. Then that theory 
is best for which this probability is greatest. Let us take first the theory 
which we assert to be the best. The probabihty, on that theory, of 
drawing five black, three white, one red and one green ball (the order 
being immaterial) is 

10! 



Uoj uoj "lo'io" 



= 0-042525. 



5!3!1!1! 
The corresponding probability on the other theory is 

(12) (12) ■l2"l2"5!3!l!l! = ^"^^^^^^- 
The former theory is therefore the better of the two. If from other 
sources we knew however that the bag did contain balls of six colours, 
then the first theory would be ruled out, and the second theory would 
have to be compared with other six-colour theories. The reader might, 
for example, compare it with the theory that the bag contains -^ths 
black, ^ths white, ^ths red and -ji^th each of green, blue and yellow 
balls. 

Turn now to our actual experiments on bisecting a line. The 29 
experiments are like 29 dips into a bag, resulting in the numbers on 
p. 15 being drawn. From our general consideration of the problem 
we beUeve that the numbers in the bag form a continuum, that is we 
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think that another bisection mark might fall anywhere within the range, 
and not only at points already struck, though of course our measurements 
are only being made to the nearest tenth-millimetre. Secondly we think 
that a curve of the form 

will express this continuum. The problem is, which curve of this form 
is the best fitting one. This is decided in exactly the same way as was 
the above example of the coloured balls. For each curve find the 
probability that the actually observed distribution may have arisen 
from it by random sampling. Then that curve is the best for which this 
probability is greatest. 

The actual process of trial and error giving now this now that value 
to a and a would take too Icfng, and we find the optimum values by the 
usual process of the differential calculus. Let us do so in a quite general 
manner, taking n values (instead of 29 as here). 

Given a distribution following the law 

«= \ e-(3:-a)V(2a») 

^ a-v/(27r) 

what is the probability Q that the n special values 

*1> ^2) ■'^S' -"iJ ■•• •''>^' ••• ^n 

will be obtained in a set of n trials (the order being immaterial)? The 
probability of x>, is 7 

aV(277) 

and the required probabiUty is equal to the continued product of a 
number of such expressions, in which A takes the values 1 to n succes- 
sively, multiplied by n ! because the order is immaterial. 
We therefore have 

This probability we wish to make a maximum by choosing the best- 
values of a and a. We must therefore put 

dQjda = 

and dQjda = 0. 

Now f = n\ f^y e-^(«.-»)V(2^») x ^i^, 
da \a V 277/ <^ 
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and since this equals zero we must have 

S {x^ -a) = 0, 
:. S (x^) = S(a) = na, 
a = S {X))ln. 
That is, a must be the mean of the observations. Turning now to the 
second equation we have (omitting from the first the factor dxj-\/(2n), 
which is independent of a) 



^(1 
da [a" 
Therefore 



a(x^-a)W') 






S {x^ - a)'' 



— na^> 



0. 



nafi — S (xx — a)^, 
0^ = 8 (ajx — a)^jn, 
i.e. a is the standard deviation of the readings. 

We find therefore that to obtain the best fitting curve we must 
find a the value of the mean of the observations, and a the value of the 
standard deviation. In the case in question 

a = 60-13, 
ff= 1-38, 
29 



so that 



y 



- (a- 60-13)2/(2 xl-aSi*) 



1-38 ^(2^) 

where a; is in mms. This curve is drawn on the adjoining figure. Calcula- 
tions, such as those required to find a number of ordinates of the above 
curve for the purpose of drawing it, are best performed in tabular 
fashion. For example, the present calculation might be arranged as 
follows, and the reader should calculate one or two ordinates for practice 
by this means. 



(a) 


(6) 


(c) 


id) 


(«) 


(/) 


X in mms. 


a; -60-13 


2 X 1-38' 


Column c 
X loge 


Becipiocal of 
Antilog. column d 


29 
Column ex^.3g^^2.^ =3, 















This arrangement is suitable for an approximate calculation using 
a ten inch slide rule, from which the logarithms are also taken. The 



CH.ii] THE ELEMENTARY THEORY OF PEOBABILITY 43 

accuracy thus attained is quite as great as the extent of the experiment 
justifies. If logarithm tables, or calculating machines, were to be used, 
somewhat different tabular arrangements would be required. 

Some of the calculation in the above table has, however, once and 
for all been done and printed in tables of the probability curve. The 
best of these for our purpose is Sheppard's table, printed as Table II 
n Professor Pearson's Tables for Statisticians and Biometricians. 




67 



62 



63 



64 



60 61 

Millimetres 

Fig. 6. A normal curve fitted to the bisection data. 
The histogram is also shown 

a Normal Curve to the Bisection Data with the 
Aid of Sheffard's Tables* 



I 


11 


III 


IV 


V 


X in mms. 


a;'=a; -60-13 


Sheppard's x 
a;71-38 


Sheppard's z 


!/=29z/1^38 


60-13 

60 

59-5 

59 

58-5 

58 

57 

66 




0-13 

0-63 

1-13 

1-63 

2-13 

3-13 

4-13 




009 

0-46 

0-82 

1-18 

1-54 

2-27 

2-99 


-3989432 

-397 

-359 

•286 

•199 

•122 

■030 

■005 


838 
8^34 
7-54 
6-99 
4-18 
2-56 
0-63 
0-11 



The other half of the cvirve can be drawn by symmetry. No interpolation is here used in 
the tables. A sUde rule, or Crelle's Calculating Tables, can be used for the multiplications, 

"■ Of course an experiment of only 29 observations does not justify any curve fitting 
at all, as the accuracy is not sufficient, the sample not being large enough. But a short 
example is necessary for explanatory purposes in a text-book. 
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We are at present only concerned with Sheppard's first and fifth 
columns headed x and z respectively. These are connected by the 
relationship ^ 

V(27r) 

that is they give a curve with N =\ and a=\. The x of Sheppard's 
table is, therefore, obtained from our x by division by a, and his z needs 
to be multiplied by Nja. Our calculation then takes the form shown 
above Fig. 6. 

It must be clearly understood that there are two parts in any curve 
fitting problem. Firstly there is the decision as to what kind of curve 
is to be used, and secondly the finding of the best fitting curve of this 
kind. If it were merely a question of getting a curve to fit the particular 
data of one problem, then a curve could always be drawn to fulfil the 
conditions exactly. It is only because general considerations dictate that 
the curve shall be of a certain kind, that an exact fit cannot be obtained 
and the best fit has to be found. 

(8) THE METHOD OF LEAST SQUARES 

The principles adopted in the last section are those which underlie 
the Method of Least Squares, which is employed to find the best 
solutions of a set of Unear equations which are more numerous than 
the unknowns, and slightly inconsistent with one another. To illustrate 
the principle take three equations for two unknowns, 

ax +by + c =0, 
a'x + b'y +c' = 0, 
a"x + b"y + c" = 0. 

The quantities a, a', a", b, etc. having been measured, the equations 
are found not to come to zero for any values of x and y, but to give 
"residuals" v, v', and v" thus: 

ax + by + c =v, 
a'x + b'y + c' = v', 
a"x + b"y + c" = v". 

Now the presence of these residuals v may be assumed to be due to 
numerous small errors in the coefficients a, b, c, and the distribution of 
any v, .were one of these equation-observations to be repeated many 
times, will, it may be assumed, be a Normal Curve. The probability 
of occurrence of v will therefore contain as chief factor e-*' and the 
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probability of occurrence of v, v' and v" (presuming them independent) 
will contain the factor 

e-"" X e-'^' X e-'^", or e-^C^), 
where S (v^) = v^ + v'^ + v"^. 

To make this probability a maximum we must make S (v^) a minimum, 
whence the name Least Squares. The conditions for S (v^) a minimum 
are 

dS/dx = dS/dy = 0, 

and also (though in practice it is not necessary to find them) the second 
differentials must be negative. We get at once 

^ iXax + by + cf + {a'x + h'y + c'f + (,a"x + h"y + c"f\ 
= 2 (a2 + ffl'2 + ffl"2) a; + 2 (a6 + a'h' + a"h") y + 2(ac+ a'c' + a"c") = 0, 

and similarly for y. 

The two equations thus reached are called Normal Equations. There 
will of course always be as many of them as there are unknowns. If the 
original equations were not of equal reliability of weight they must of 
course first be multiplied by their weights. The rule of Least Squares 
is therefore: To obtain the Normal Equation for any unknown x, multiply 
each equation by its weight w, and by the coefficient ofxin that equation, and 
then add all the equations together. The Normal equations in our example 
are thus: 

S (ahv) x + S (abw) y + S {acw) = 0, 
S (abto) x + S (b^w) y + S (bcw) = 0, 
whence unique values of x and y can be found. 



CHAPTER III 

THE PSYCHOPHYSICAL METHODS 

Experimental methods and mathematical processes — ^The method of limits — ^The 
method of average error — ^The constant method — Difference thresholds and the 
probability of a judgment of a certain category. 

(1) EXPERIMENTAL METHODS AND MATHEMATICAL PROCESSES 

The experimental determination of absolute and difference thresholds 
or limina is complicated and difficult. A considerable number of physical 
and psychological, and, it may be added, mathematical, factors is 
involved, of varying relative importance in different cases. The result 
is that different methods of procedure have been found most suitable 
for different cases. These methods have been traditionally grouped 
under three (or four) distinct headings, and called the Psychophysical 
Methods. They are: 

(1) the Method of Limits (Method of Minimal Changes), 

(2) the Method of Average Error (Method of Production), 

(3) the Constant Method (Method of Eight and Wrong Cases). 
A fourth method is generally added to the list, viz.: 

(4) the Method of Equal Appearing Intervals, or Method of Mean 
Gradations, but this is really no new method. It owes its special name 
to the nature of the task which it fulfils, viz. the determination of equal- 
appearing (uehermerklich) sense-distances as distinguished from just 
perceptible (ebenmerklich) sense-distances. The method which it employs 
falls under one or other of the first three headings. 

There are two things, essentially different from each other, which 
are commonly confused under this one heading "psychophysical 
methods," namely the methods of experimenting in order to obtain 
data, and the jyrocesses of calculation after the data have been collected. 
To avoid this confusion the words "method" and "process" will be 
employed throughout this book in the way indicated by their use in 
the above sentence. It is urged that their general adoption would be 
advantageous. The cause of the confusion is to be found in the historical 
development of the subject, for with each of the methods of experi- 
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menting a process of calculation was associated and the one name was 
given to both. 

The experimental methods of determining thresholds may be 
divided into two main groups: 

1. Methods in which the stimulus is altered continuously until in the 
opinion of the subject it fulfils some given condition. 

2. Methods in which various values of the stimulus are separately 
submitted by the experimenter to the subject who expresses a 
judgment on each of them, classifying them into two or more 
categories. 

The methods belonging to the second group may in turn be classified 
according to the order in which the stimuli are submitted to the subject. 
The second group is thus subdivided as follows : 

2 (a). Methods in which the order of succession of the stimuli is 
irregular, non-consecutive. 

2 (6). Methods in which the order of succession of the stimuli is 
consecutively (i) ascending or (ii) descending. 

Under 2 (a) comes the Method of Right and Wron^ Cases. Under 
2 (6) come the Method of Minimal Changes and the Method of Serial 
Groups*. The latter is a special case of the Method of Minimal Changes, 
in which each value of the stimulus is submitted a number of times 
before the next consecutive value is submitted. A corresponding method 
under 2 (a) is the Method of Non-Consecutive Groups, in which groups 
are taken in an irregular order. It is customary in the Method of Serial 
Groups to introduce among the stimuli an equal number of " catch " cases, 
in which no stimulus (or stimulus difference) at all is given. This might 
also be done in other methods. Bach method can be further subdivided 
according as the subject is or is not warned beforehand of the kind of 
succession to expect. 

The differences which result from the use of these various "methods" 
are clearly due to their different psychological effects upon the subject. 
But we have also to take into consideration the differences in the 
"processes" of calculation. These, of course, are purely mathematical 
and ignore the subject altogether. 

* We adopt here the convenient name suggested by 6. M. Stratton who first described 
the method {Psychol. Rev. 1902, ix. pp. 444 — 447). It had, however, been independently 
discovered and employed by W. MoDougall four years previously during hia stay in 
Murray Island (Rep. Cambridge Anthrop. Expedition to Torres Straits, Cambridge, 1903, 
n. pp. 190—193). 
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(2) THE METHOD OF LIMITS 

In using this method for the determination of difference limina the 
following mode of procedure is adopted. The variable stimulus F is 
first made equal to or slightly larger than the standard S, and then 
increased step by step by small increments until the subject finds it 
just perceptibly greater than S. V is increased still more, and then 
gradually diminished until it just ceases to appear greater than S. The 
mean of the two values of V — S thus obtained is the upper difference 
limen, T„. 

Four values, instead of two, might also be obtained, viz. for (i) F just 
not perceptibly greater than S, (ii) F just perceptibly greater than S, 
(iii) V just perceptibly greater than 8, (iv) F just not perceptibly greater 
than S; (i) and (ii) being for ascending values of F, (iii) and (iv) for 
descending. T^ vnll be the mean of these four values of F — S. 

The lower difference limen, Tj, is obtained in a similar way. Both 
limina may be obtained in the same series of experiments by the 
"method of complete descents and ascents." 

A series of determinations of each limen is made and the average 
taken. A measure of scatter (mean variation, say, or standard deviation) 
is also calculated. Absolute thresholds may also be found by this method. 

The number and size of the increments employed must be adjusted 
to the particular conditions of the experiment. The subject of the 
experiment should be given a certain amount of preliminary practice 
before being started upon the work, and introspective reports should be 
asked of him. 

Among the various possible sources of error which deflect the 
subject's judgment both in this and in the other psychophysical methods, 
two are of special importance. They arise from the temporal and spatial 
arrangement of the compared stimuli, and are called the "time error" 
and "space error" respectively. Thus, in a determination of the 
difference limen for sound-intensities, the two stimuli, S and F, cannot 
be presented simultaneously. One must precede the other, and in this 
way it may produce a slight degree of fatigue which causes an over- 
estimation of the other, or it may produce the reverse effect of sharpening 
the attention to the second. Thus a time error arises. Again, in experi- 
ments with brightness-intensities or visual extents, where the stimuli 
can be presented simultaneously, a space error arises from the fact that 
the one stimulus must be presented either to the right or to the left of 
the other stimulus, and the subject's judgment varies accordingly. In 
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experiments with lifted weights both sources of error may be involved. 
These so-called constant errors may be approximately neutralised by 
arranging that in the course of the experiment the standard shall precede 
the variable or stand to the right of the variable in half the cases and 
follow or stand to the left in the other haK — the time or space order, 
or both, being of course changed quite at random in the successive 
limen determinations. A better plan is to evaluate the limen, and its 
scatter, for each time and space order separately. This gives us the 
values of the time error and the space error, which are of interest for 
their own sakes. Fechner's theory of these errors and their measurement 
is based upon the assumption that the time and space orders of the two 
stimuli, S and F, exert an influence upon the result which is equivalent 
to a definite increase or diminution in the stimulus-value of one or the 
other, and thus increase or diminish the value of /S ~ F by the amount 
of the time error e^, or that of the space error eg, or by the sum or the 
difference of these two. The time error is positive or negative according 
as the effect of the time order is to increase or diminish the apparent 
value of the Jkst-presented stimulus. The space error is positive or 
negative according as the effect of the space order is to increase or 
diminish the apparent value of the left-hand stimulus. 

Four principal cases of time and space order are possible, and are 
conventionally numbered as follows*: 

I Standard presented first and to the right, 
II „ „ second 

III „ „ first „ left, 

IV „ „ second 

Employing these numbers as sufi&xes, we have the equations 



-t Wj — J- u ^1 ~r ^2 ' 


Tjj =T, + 61-62' 


Tu,,=T„ + ei + e,' 


^Jii = ^1-61-62' 


Tu,,,-T„-e^-e,, 


T,„r'^i + ^i + ^2' 


r„j^ = T„ + 61-62. 


Ti^^^Ti-e^ + e^, 


whence 




r«j -1- 2^«,y ^«„ + 2^„i,i 


Tui + Tu^^ -\- r„jjj -i- T„j^ 


^» 2 2 


4 


and a similar expression holds for Tj; 


and 


2ei = T„jj — T„j = T„jy — Tujjj 


= Ti^ - T,j, = T,j„ - Tjj^ , 


2e2 = r„j - r„j|j = T„jj - r„,^ 


^^^ui~ ^h = ^hy~ ^hi- 



* G. E. Mailer, Die Geaichtspunkte und die Tatsachen der psychophysischen Methodile, 
1904, pp. 67, 71. 

B. & T. 4 
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The errors due to expectation, habituation, fatigue, etc., are neu- 
tralised or at least reduced to a minimum by determining the limen by 
means of both ascending and descending values of V and averaging, and 
by other special precautions in applying the method. 

The Mathematics of the Method of Limits 

It is to Professor F. M. Urban that we owe the most complete dis- 
cussion of the mathematical foundations of this method*. 
Let the variable stimulus F have the values 



''I) "2' °3' 



... s„ 



Then for constant experimental conditions th^re will be. Prof. Urban 
assumes, for each stimulus a probability that a certain judgment (say 
Greater) will be given. Let these probabilities be 

Pi, Pi, Pa,- Pn- 
If the stimuh were arranged in order of magnitude beginning with the 
smallest, then these probabilities will also be so arranged. Let us 
further write q for the probabiUty that this judgment will not be given. 
Then for each q and p we have 

1 - P = ?• 
Consider ascents first. A stimulus s is noted as a just perceptible 
point if the answer greater is given at s and was not given at any lower 
point in that series. This is a compoxmd event, and its probabihty is 
the product of a number of q's for the lower stimuli where greater was not 
the answer, and one f for the%timulus s itself where the answer greater 
is returned.- Let P represent the probabiUty that s will be noted as one 
reading of the just perceptible point, then we have the set of equations 

Pi = Pi, 

Pi = g'iZ'2. 

^3 = MiPa, 



Pn = qi<l2--9.n-lPn, 

and the mean of the just perceptible points will be 

T = s^P^ + s^P^ + S3P3 + ... + s„P„ = S (sP). 

The standard deviation will be given by 

> = iS {{s - Tf P}^S (s^P) - T^- 

* See "Die psyohophysischen MasBmethoden," Archivf. d. gea. Paychologie, 1909, xv. 
p. 289; "On the Method o£ Just Perceptible Differences," Psychol. Sev. 1907, xiv. p. 244; 
The Application of Statisticai Methods to the Problems of Psychophyaica, Philadelphia, 1903. 
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An example will make the whole of this much clearer. Weights of 
84, 88, 92, 96, 100, 104 and 108 grams were compared, by lifting, with 
a standard weight of 100 grams, and the replies given, which referred 
to the variable weight, were heavier, eqiial and lighter. They 
recorded as follows: 



were 



84 88 92 96 100 104 



108 
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h 


h 


e 


h 


h 
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e 


h 


h 


h 


h 
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1 


h 


h 


h 


h 




1 


1 


e 


e 


h 


h 




1 


e 


h 


h 


h 


h 



and so on for 400 rows, the letters h, e and 1 being the initial letters of 
the answers given. The five rows shown in extenso give five readings of 
the just perceptibly heavier point, viz. 92, 88, 96, 104 and 96 grams. The 
400 of these obtained from the 400 rows were distributed as follows 
(Urban's Subject I): 

Grams 84 88 92 96 100 104 108 

Frequency 7 30 76 106 169 12 400 in all. 

The mean of these points is 100-36, which is therefore the directly 
observed threshold of just perceptible positive difference. (It should be 
noted that a time error is included which accounts for the nearness of 
this point to the standard.) The standard deviation is 4-35 grams. 

The frequency with which the answer heavier was given at each 
stimulus can also be found from the records. The application of Urban's 
Formula, based on these frequencies p, is then as follows : 



Grams 


s 


P 


2 


q products 


P 


Ps 


Ps' 


84 


-4 


•0022 


•9978 


•9978 


•0022 


-•0088 


•0352 


88 


-3 


■0200 


•9800 


•9778 


•0200 


-•0600 
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92 


-2 


•0889 


•9111 


•8909 


•0869 


- ^1738 
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96 


-1 


•2222 


•7778 


•6929 


•1980 


-•1980 


•1980 


100 





•4133 


•5867 


•4065 


•2864 






104 
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•8956 


•1044 


•0424 


•3641 


•3641 


•3641 


108 
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•9400 


•0600 


•0025 


•0399 


•0798 


•1596 


112 
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•0025 


•0075 


•0225 
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1^0000 


•4514 


13070 




grams per wor 


dng unit 4 




-•4406 


•0001 =J'2 
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^=•0108 
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100^0432 




•0432 
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ongm 


100 
1000432 


4^56 grama— o- 



See explanation in text to follow. 
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The calculated point of just perceptible positive difference is therefore 
100-04 grams and the standard deviation 4'56 grams. It will be seen 
that a working origin has been taken at 100 grams, and a working unit 
equal to i grams, to simplify the arithmetic. The coliunn headed 
q froduds is the continued product of the q's, from the 84 end. The 
P column is most quickly formed as the differences of successive 
numbers of the product column. The P's rise to a maximum and then 
sink again. They represent in fact the cocked hat distribution of the 
just perceptibly heavier points, and should be compared with the actual 
distribution of the latter. The values P really give the distribution which 
would be found were an infinite number of ascents to be made, the 
'probabilities of the answer heavier at the different points remairdng throvgh- 
out constant at the actual frequencies which are found in these 400 ascents. 

It will be noticed that the P's do not add up to unity imless the 
amount 0-0025 is included. This quantity gives the probability that 
an ascent should be made without obtaining any answer heavier. In the 
example these "just perceptibly greater" points are centred at the 
weight 112, since it is necessary to make some assumption as to their 
position. Strangely enough, Professor Urban himself omits them in his 
calculations, causing in some cases quite an appreciable error. This 
difficulty about the "tail" of the P distribution would not have been 
necessary had the experiments been extended to a point where all the 
answers were heavier. The peculiar difficulty about this is however that 
the psychological conditions would thereby be changed*. The "tail" 
difficulty will therefore always be present in threshold measurements. 
It causes mathematical troubles in all the methods except only in the 
process of calculation used in the Constant Method, and will be fre- 
quently referred to in this and the succeeding chapters. 

If the steps between the stimuli are equal, as here, a saving in 
arithmetic which escaped Professor Urban's notice can be effected by 
using the summation method described on page 19. The sum of the 
q products gives the distance of the threshold from the end stimulus. 
If the equal increments between the stimuU are not unity but x, the 
sum of the q products must first be multiphed by x. The standard 
deviation can also be obtained by a further application of the summation 
process; the details are left to the reader who should consult the next 
chapter. The consideration of this point here might lead us too far 
from the main argument. • 

* See inter alia the article "On Judgments of Like," Frank Angell, Am. Jowm, 
Psychol. 1907, xvm. p. 253. 
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In the same experiment the points of just not perceptible difEerence 
were distributed as follows: 



Grams 
Frequency 



84 




88 
8 



92 
36 



99 



100 
200 



104 
35 



108 
22 



400 in all. 



Of these the mean is 98-84 grams, and the standard deviation 
4-02 grams. The application of Urban's Formula in this case gives the 
following results : 
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The mean threshold is therefore 



by the direct method 
by Urban's Formula 



(100^36 + 98^84)/2 =99^60 grams, 
(100^04 + 99^14)/2=99^59 grams. 



Professor Urban's own calculated values are sUghtly different, partly 
because he neglects the tail of the distribution as above explained, 
partly because of an arithmetical error. 

The practical applications of Urban's Formula are few, though it is 
occasionally useful in cases where the direct observation of the just 
perceptible points is impracticable*. Its great value lies in the con- 
clusions which it enables us to draw. In the first place it shows clearly 
that the order in which the stimuH occur has no effect on the value of 
the threshold from a mathematical point of view, for the sum of the 
quantities sP is independent of their order, and the P's themselves are 
equally independent thereof. This does not of course mean that a change 
of order of the stimuU will have no psychological effect. There is no 
doubt that the effect of a non-consecutive series of stimuli will be very 



» See e.g. G. H. Thomson, "Changes in the Spatial Threshold during a Sitting," 
Brit. Journ. Psychol. 1914, vi. p. 438. 
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different from that of an ascending or descending series, inasmuch as 
the latter will arouse feelings of expectation and the like. Professor 
Urban therefore recommends that except when it is specially desired to 
study the results of ascents or descents the stimuli should be arranged 
in non-consecutive order. In fact it will be seen that the above described 
simple mathematical process can also be applied to data collected by the 
Method of Eight and Wrong Cases. We have here an example of the 
distinction between methods of collecting data on the one hand and 
processes of calculation on the other. The Limiting Process of calcula- 
tion can be applied to the Constant Method of collecting data. 

Professor Urban however, although recommending non-consecutive 
stimuli, does not recommend that the data should be collected in the 
way usual in the Constant Method if it is intended from the outset to 
apply the Limiting Process. On the contrary, instead of keeping the 
stimuli constant throughout the experiment, he advises a frequent 
change of stimulus-series. For he found, from consideration of the 
results of interpolating and of irregularly spacing stimuli, that the final 
values obtained by the Limiting Process are dependent upon the par- 
ticular stimuli used, in a mathematical way. To free results from this 
bias as far as possible, he recommends the following procedure. The 
steps in any one series should'be of equal size, and the starting point 
of the steps should be altered from series to series until the whole area 
has been covered. For example, if the spatial threshold were being 
investigated, the stimuli used might be* 

First series 0, 1, 2, 3, 4, 5, 6, 7, 8 centimetres; 
second series 01, 1-1, 2-1, 3-1, 4-1, 5-1, 61, 7-1, 8-1; 
third „ 0-2, 1-2, 2-2, 3-2, 4-2, 5-2, 6-2, 7-2, 8-2; 



tenth „ 0-9, 1-9, 2-9, 3-9, 4-9, 5-9, 6-9, 7-9, 8-9. 

The series might of course follow each other in any order. When this 
is done, Professor Urban shows that the final limen arrived at is the 
point where the probabihty of the response in question is one-half. 
That is, the Limiting Method finds the median thresholdf . 

A modification of the Method of Limits, which effects a more com- 
plete elimination of the expectation error by the use of "catch experi- 
ments," where the variable given is equal to the standard, and which 

* See G. H. Thomson, "Changes in the Spatial Threshold during a Sitting," Brit. 
Journ. Psychol. 1914, vi. pp. 435 — 6, where this plan is followed. 

t For other discussions of the mathematics of the Limiting Process and allied subjects 
the reader may consult articles by Wirth, Urban, Lipps in the Archivf. d. ges. Paychohgie, 
etc. 
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also collects data at a quicker rate than the ordinary Method of Limits, 
is that known as the Method of Serial Growps*. 

Each fixed value of the variable stimulus is presented with the' 
standard ten times, not in immediate succession, but interspersed at 
random among ten other values of F equal to S. The percentage of 
correct answers given by the subject is noted, and the experimenter 
passes on to the next value of F which is presented along with catch 
stimuli in a similar manner. The value of F which, as presented in this 
way, gives 80 % right answers, is arbitrarily chosen as measuring the 
limen. Of course this choice of 80 % makes the method measure a 
totally different point from the 50 % point measured by the Method of 
Limits, but this is not essential to the method as a method of collecting 
dataf. As such it is a very convenient one to use in measuring a large 
number of subjects in "mental test" experiments, or with primitive 
people, where economisation of time is essential. 

The mathematical theory of this method is similar to that of the 
Method of Limitsf . It has been shown that mathematically the Method 
of Limits is superior, and that the mathematical disadvantages increase 
with the size of group taken. For this reason groups of four have been 
suggested instead of ten §. Further, the arbitrary use of the 80 % point 
obscures comparison with the results of other methods. The 50 % point 
would be better although it does not allow so large a number of subjects 
to be measured in a short time as does the 80 % point, if, as is assumed, 
each descent is stopped as soon as the required point is reached, and 
a new descent begun. With groups of four, the 75 %, 50 % and 
25 % points can all be noted if time permits of complete descents 
and ascents (thus giving both the median limen and a measure of 
scatter, the interquartile range), while if time pressed, the 75 % point 
would be sufficiently comparable with the 80 % point of previous 
experiments. 

As in the Method of Limits, the mathematical foundations are 
unaltered if the groups cease to be in serial order. We thus arrive at 

* See Text Book of Experimental Psychology, C. S. Myers, Cambridge, 1911, p. 196. 

t See G. M. Stratton {Psychol. Review, 1902, ix. pp. 444 — 447), who says "a detafl 
like this, as well as the exact number of experiments that may best form a group, might 
well be considered as subject to revision in the light of further experience and not as an 
essential part of the method." 

X 6. H. Thomson, "A Comparison of Psychophysical Methods," Brit. Jornn. Piychol. 

1912, V. p. 212. 

§ "An Inquiry into the Best Form of the Method of Serial Groups," Brit. Joum. 
Psychol. 1913, v. pp. 398—416; and "The Probable Error of Urban's Formula," ibid. 

1913, VI. pp. 217—222. 
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a Method of Non-Consecutive Groups*, which has been held by one 
experimenter to be the best method of collecting data. 

It may be pointed out in passing that the Method of Groups is one 
which is naturally employed in other branches of mental measurement 
as well as in psychophysics. For example, the widely known Binet Tests 
are given in groups beginning at a group designed to suit a child of very 
young age, and are proceeded with until a certain percentage of passes 
is obtained at some group. Usually however modifications in the form 
of marks for tests passed above the critical group are admitted. There 
is no doubt but that the mathematical foundations of many of these 
devices require examination in the light of the theory of probabiUtyf . 

It may also be pointed out that as methods of collecting data the 
Method of Serial Groups, and that of Non-Consecutive Groups, especially 
the latter, approach the principle of the constant method yet to be 
described, and indeed their data can very well be handled by the pro- 
cesses in use in the latter method. 

(3) THE METHOD OF AVERAGE ERROR 

In this method the subject is required to adjust a variable stimulus 
so that it seems subjectively equal to a given standard stimulus. In this 
it differs considerably from the other methods in which the experimenter 
does the adjustment, usually out of sight or even before the sitting 
starts, and the subject expresses an opinion on each stimulus but does 
not alter it. The alternative name of the present method, viz. the Method 
of Production, suitably emphasises this important psychological point. 

Mathematically the method presents differences which are mainly 
due to the fact that in this method almost any value of the variable 
stimulus can crop up, whereas in the other methods the experimenter 
customarily keeps to certain steps, so that the results are as it were 
heaped up at certain points. 

The experiment is repeated a large number of times — at least 100 — 
and the arithmetical mean of all the obtained stimulus-values is calcu- 
lated. The difference between this mean value and the standard stimulus 
is known as the crude constant error e. It may be either positive or 
negative. A measure of scatter is also found. 

The crude constant error may be partly due to a space error (the 
time error cannot occur in this method), partly to other constant 

* Thomson, Brit. Journ. Psychol. 1912, v. p. 205. See also Brit. Joum. Psychol. 1914, 
VI. p. 434, where this method was employed. 

t See Francis N. Maxfield, "Some Mathematical Aspects of the Binet-Simon Tests," 
Journal of Educational Psychology, 1918, ix. pp. 1 — 12. 
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conditions*. Let us assume that the experiment is to adjust the length 
of a variable line until it seems to be equal in length to a standard liue. 
In this case, the standard should be situated to the right in one-half 
the number of adjustments, and to the left in the other half, either 
alternately or in haphazard order. The results are tabulated in two 
columns, I standard to the right, II standard to the left, and the means 
of these two found separately. Half the difference of these means is 
the space error, while half their sum, less the standard, gives the residual 
constant error. 

In order to give as much definiteness as possible to the task of 
adjustment, the variable should start sometimes shorter than the 
standard, sometimes (an equal number) longer, and the adjustment be 
made by lengthening or shortening respectively. Again, the requisite 
amount of shortening or lengthening should be arranged to be different 
on different occasions, but alternating with some degree of regularity. 

The value of the scatter obtained in this method is from a general 
point of view a more important result than the value of the constant 
errors, since it has often been regarded as proportional to the value of 
the difference threshold as determined by the other two psychophysical 
methods. The truth is that although there is a certain amount of 
proportionality between the values, this proportionality is not complete. 
Under certain conditions the two values vary in opposite directionsf. 
It is hardly necessary to point out that the scatter of the individual points 
obtained in the Method of Limits has but little relation to that reached 
by the Average Error Method. They are not entirely unrelated, however. 

The closest correspondence of any is that between the distribution 
of the errors in the present method and that of the judgments "equal" 
in the Constant Method. 

(4) THE CONSTANT METHOD 

This is generally regarded as the most satisfactory of the psycho- 
physical methods. It can be employed with equal convenience for the 
determination of absolute thresholds, difference thresholds, equal- 
appearing sense-distances, and other measurements of psychological 
importance. The different values of the variable stimulus to be employed 
are fixed once for all at the beginning of the investigation, and are 
presented to the subject a large number of times (say 100 applications 

* Cf. E. B. Titchener, "Experimental Psychology," Student's Manual, n. p. 74: "A 
constant error is simply an error whose conditions are constant; its aTrumnt may vary, 
quite considerably, from stage to stage of a Jong series of experiments." 

t This question is connected with the point discussed on p. 75 et leq. 
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of each) in irregular order, or in a prearranged order, unknown to the 
subject, corresponding to certain precautions. If an absolute threshold 
is being determined, the variable is presented alone; if a difEerence 
threshold, it is on each occasion preceded, acconapanied, or followed by 
the standard. In the latter case, the subject returns the replies greater, 
uncertain or equal, less, with reference either to the standard, or to the 
variable, or to the first presented stimulus, or on some other prearranged 
plan. The percentage of each of these three types of answers is deter- 
mined for each value of the variable used, and recorded (it is also 
advisable to retain the raw data in the form of the actual answers in 
the order given). 

As one illustration of this method, we shall find it convenient to 
refer to a series of results obtained by Riecker [Zeitschrift f. Biologic, 
Bd. X.) which has already served in the descriptions of the method 
given by Miiller, Titchener, and Myers. Riecker obtained the following 
results in an investigation of the "spatial threshold," or threshold of 
two-point judgments of the skin of the lower eyeUd: 

s (distance between the points of 

the aesthesiometer in Paris . 0-5 1 1-5 2 3 4 5 6 

lines)* 
lOOp (% of two-point judgments) 30 10 14 40 65 80 87 96 100 

30 -20 4 26 25 16 7 9 4 

It will be observed that, with two exceptions, the series of per- 
centages follows a general law of increase, the rate of increase itself 
increasing at first and then diminishing. The numbers in the lowest 
line are formed by subtracting each percentage from the immediately 
succeeding one, and show this uniformity more clearly. The two excep- 
tions are at s = and .s = 5. At s = the percentage is greater than at 
the immediately succeeding stimulus. This irregularity is known as an 
inversion of the first order. The cause of it is doubtless to be looked for 
in the exceptional way in which the stimulus (one point) may have been 
applied, the pressiire and general nature of the contact may have been 
different from what they were with two-point contact, or some mis- 
leading suggestion may have accompanied these particular experiments. 

At s = 5, the percentage is indeed larger than that for s = 4 and 
smaller than that for « = 6, but reference to the differences shows that 
the increase from 4 to 5 is greater than the increase from 3 to 4, thus 
breaking the general rule as regards rate of increase. This irregularity 
is known as an in/oersion of the second order, and being in the present 

* A Fans line =2-25 mms. 
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case slight, is probably to be explained as due to an insufficiency in the 

number of applications of the stimulus. 

As a second example, in this case one dealing with a difference 

threshold, we shall take the data for lifted weights which have already 

been used on p. 51 in connection with the Method of Limits. The 

standard weight was 100 grams and was lifted before each of the seven 

comparison weights. The judgments given were lighter than, eqval to, 

or heavier than the standard. With Subject I the answers heavier were 

distributed as follows, in 450 trials with each weight: 

Comparison weight 4 84 88 92 96 100 104 

Answers Aeawer 1 9 40 100 186 403 

Proportions p -0022 -0200 -0889 -2222 -4133 -8956 

The third row of this table is simply the second row divided by 450*. 

A figure of the quantities p forms a curve which, following Galton, we 
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Fig. 7. Urban's Subject I. ^=proportion of answers Acot«er 

may describe as an ogive. It is shown in Fig. 7, where however no 
attempt is made to draw a smooth curve through the points, which are 
merely joined by straight lines. 

Here there are no inversions of either order, and the example is 
therefore a more convenient one for a first explanation of the various 
processes of calculation which may be adopted. We shall for this reason 
deal with it first, returning later and considering Riecker's more irregular 
numbers. 

We must commence by defining the limen, and do so in the first 

* This figure is correct. But (as stated on page 51) Urban only gives 400 values of 
the just perceptibly heavier points : we are unaware of the reason which led him to discard 
the other SO. 
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place as the point corresponding to 50 % of the judgments in question. 
For example, in our case we define the limen for the response heavier 
as being the point where the subject would give half his answers heavier, 
the other half being lighter, equal, or whatever other answer is allowed, 
but not heavier. Above this point he is more likely to answer heavier 
than not, below it he is more likely not to give this reply. 

The limen corresponding to 50 % heavier judgments may be calcu- 
lated in two general ways: (A) from the observed values, (B) by finding 
the best fitting smooth curve, adjusting the observations to this, and 
then calculating the constants (mean, mode, scatter, etc.) from the curve. 

A (1). Linear Interpolation 

The required limen obviously falls somewhere between 100 grams 
(41'33 % answers heavier) and 104 grams (89-56 % answers heavier). 
A very simple way therefore of determining its value is to assume these 
points joined by a straight line, as in Fig. 7, and find where this line 
cuts the 50 % line. Arithmetically this means finding a weight which 
divides the interval from 100 to 104 grams in the same proportion as 
50 % divides the interval between 41-33 % and 89-56 %. If T be this 
weight, we have then 

T - 100 _ 50 - 41-33 
104 -T~ 89-56 -50' 
and with practically no calculation we obtain the value 100-72 grams 
for the limen. 

This method though commendably simple is open to several ob- 
jections: 

(1) It does not employ all the data ; it uses two of the percentages 
only. 

(2) The assumption that the curve is a straight line at this point 
is unlikely to be exact. 

(3) It gives no measure of scatter. 

A very simple extension of the above idea has been suggested* 

which obviates the third and to some extent the first of these objections. 

This is to calculate the 75 % and the 25 % points by the same simple 

form of Unear interpolation as that employed for the limen itself, that 

is, on the Fig. 7, to find where the zig-zag ogive crosses the -25 and 

•75 lines. These distances are readily found by a short calculation 

to be 96-58 and 102-79 grams respectively. Half the interval between 

* 6. H. Thomson, "A Comparison of Psychophysical Methods," Brit. Journ. Psychol. 
1912, V. p. 210 footnote. 
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them, namely 3-10 grams, is the semi-interquartile range, a rough but 
very practical measure of scatter. 

The fact that the 50 % point is not half-way between the 25 % point 
and the 75 % point is a rough indication of the skewness of the data: 
it divides the interquartile range in the proportion 4-14 grams below 
and 2-07 grams above. 

Moreover, it will be found that in practice the mean of the three 
values, the 25 %, 50 %, and 75 % points, gives a fairly good approxi- 
mation to the threshold found by more compHcated calculations assuming 
a symmetrical distribution. In the present case this gives 

96-58 + 1 00-72 + 102-79 , . . _ „ 

= 100-03 grams. 

o 

A (2). The Arithmetical Mean* (Spearman's Formula) 

A value which can be calculated by a use of all the data is that of 
the mean or average limen. Before proceeding to consider this, it is 
important to realise clearly the fact, which 6. E. Mullerf was the first 
to point out and emphasise, that a Umen is a variable magnitude 
following a law of frequency-distribution. There is no fixed limen, only 
an average limen, a most frequent limen, or a most representative limen. 
The p's in the table (p. 59) represent the relative frequency of limina 
for stimuli below the corresponding s. Thus there were 41-33 % limina 
below 100 grams, and 22-22 % limina below 96 grams; i.e. there were 
41-33 — 22-22 or 19-11 % limina between the Umits of stimulus-values 
96 grams and 100 grams. This suggests the plan of plotting a frequency- 
polygon or histogram for the Umina using these differences of the p's. 
For the present case the table of differences runs: 



Below 84 grams 


1, or 


•0022 


84^88 




8 „ 


•0178 


88—92 




31 „ 


•0689 


92—96 




60 „ 


•1333 


96—100 




86 „ 


•1911 


100—104 




217 „ 


•4823 


104^108 




20 „ 


•0444 


Above 108 




27t „ 


•0600 



* "The Method of Bight and Wrong Cases without Gauss's Formulas," Brit. Journ, 
Psychol. 1908, n. pp. 227—242. 

f This at least is the view to which Miiller himself inclines {GesicMspunkte, p. 59), 
but he deduces his formulae on the assumption, supported by Fechner and Bruns, that 
the threshold has a single definite value, subject in the course of an experimental deter- 
mination to variable apparent increase or decrease by random influences which obey the 
Normal Law of error. He points out that both views lead to the same formulae. Indeed 
the distinction is merely a verbal one, in our opinion. 

t This is not necessarily an inversion of the second order, for it may be spread out 
to any distance above 108 grams. 
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and the histogram, or pseudo-histogram* as it is safer to call it, is given 
in Fig. 8. 

If we now wish to proceed to the calculation of the mean of these 
limina we must decide where to centre those which lie between say 
92 and 96 grams. It is well known that this centre must be taken in 
the actual geometrical centre of each intervalf, and Professor Spearman's 
suggestion to use not the mid-ordinate, but an ordinate shifted slightly 
towards the higher of the two neighbouring columns, is unnecessary. 
The unadjusted value is the best value of the mean. The source of 
Professor Spearman's error lies in the fact that in his discussion he 
replaces each rectangle of the pseudo-histogram by a trapezium of 
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Fig. 8. Pseudo-histogram of the 4S0 limina. Urban's Subject I, answers heavier 



different area, and then multiplies the centroid abscissa of this new area 
by the old area, which could not possibly have the centroid of the new 
one. 

A real difficulty, and one which robs the method of much of the 
practical utility which it would otherwise have, is the impossibilUy of 
fixing in any unambiguous fashion the centres of the tails of the distribution, 
that is those limina which in our example lie below 84 and above 108 
grams respectively. This is the same trouble as that referred to on 
p. 52 in connection with the application of Urban's Formula. 

* See later, p. 79. 

t Cf. e.g. W. F. Sheppard, "On the Calculation of the most probable Values of Fre- 
quency Constants," Proc Land. Math. Soc. xxix. pp. 353 £E. 
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The simplest thing to do is to assume that at 80 grams the subject 
would never, and at 112 grams would always, have answered heavier, 
i.e. to centre the tails at 82 and at 110 grams respectively. We shall 
do so in this case, although it is clear that at the upper end the tail is 
probably more spread out than this. We then have the following calcu- 
lation for the mean or average hmen : 



1 at 


82= 82 


8 „ 


86= 688 


31 „ 


90= 2790 


60 „ 


94= 5640 


86 „ 


98= 8428 


217 „ 


102=22134 


20 „ 


106= 2120 


27 „ 


110= 2970 




44852 total 



Dividing this by the whole number of hmina, 450, we obtain 99-67 grams 
as the threshold by this method. The standard deviation can also be 
foimd, CT = 5-1 grams (but see next chapter for a correction to this 
value, known as Shejypard's adjustment). 

The summation method of finding the mean, which is explained on 
p. 19, again enables a saving in arithmetic to be effected in this calcula- 
tion. Apphed directly to the frequencies p it works as follows in the 
present case, and obviates the actual formation of the pseudo-histogram. 

Frequency of answers heavier at 



ivier at 84 


grams 


-0022 


88 


>» 


-0200 


92 


„ 


■0889 


96 


» 


-2222 


100 


It 


-4133 


104 


„ 


-8956 


108 


»» 


-9400 
2-5822 sum 

4 gram units 



10-3288 
This must be sabtracted from origin 110 

99-6712 grama threshold. 

The reader must be referred to a study of the summation method on 
p. 19 to realise why in this case the sum obtained has to be subtracted 
from the origin. 

We can at this point call attention to an instructive comparison 
which can be drawn between the Method of Limits and the Method of 
Right and Wrong Cases, as regards the processes of calculation employed 
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in them, on the basis of Urban's Formula for the former and Spearman's 
Formula* for the latterf. 

Urban's Formula, S (sP), shows the threshold as the mean of a 
number of just perceptible points, which are centred at the stimuli used, 
the quantities P being the differences of successive products of the 
frequencies of the answers heavier (or not heavier). 

Spearman's Formula, as exemplified in the above calculations, can 
be written S (s . dp), where dp is the difference of the successive fre- 
quencies p, and shows the threshold as the mean of a number of limina 
which lie between the stimuh used. 

B. Methods which fit smooth curves to the data 

The idea underlying these methods is to run a smooth curve through 
the points of Fig. 7 (the p values) instead of simply joining them by 
a zigzag, and then to ascertain where this smooth curve, which does 
not necessarily go exactly through any of the points but smooths off 
any inequaUties, passes the 50 % line. Any suitable curve which 
happened to occur to one might of course be employed. For example, 
a parabola of high order can be usedj, and the curve tan~^ 6 has also 
been tried §. But clearly the whole experiment suggests that an error 
function of some sort is wanted, and as early as 1860 G. T. Fechner 
suggested 1 1 that such numbers formed the integral of a Normal Curve 
of Error. 

This idea would naturally occur to anyone accustomed to handUng 
the Normal Curve on considering the pseudo-histogram or table of 
differences. Fig. 8. The obvious skewness of the diagram would also 
strike such an observer, it may be noted in passing, but with this point 
we shall deal in a separate chapter. 

The obvious way to fit such a histogram with a Normal Curve is 
that given in the previous chapter, namely, to find the mean and the 
standard deviation, and use these constants in the expression for the 
curve. To this there are however important practical objections, the 
chief being the difficulty of the undefined tails, to which reference has 

* We use Professor Spearman's name for this plan since he was the first English 
writer, we believe, to call attention to it: but we do not include the erroneous point in 
his calculation to which reference has already been made. 

t G. H. Thomson, "A Comparison of Psychophysical Methods," Brit. Journ. Psychol. 
1912, V. p. 226. 

X F. H. Urban, "Die psychophysischen Massmethoden als Grundlagen empirischer 
Messungen," Archiv f. d. ges. Psychologie, 1909, XV. and xvi. pp. 335 — 355. 

§ Urban, loc. cit. p. 393 ef eeq. 

II G. T. Fechner, Elemente der Paychophysik, 1860. 
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already been made in discussing Spearman's plan of finding the mean. 
This difficulty is still more acute when we attempt to find the standard 
deviation. This tail difficulty does not occur in the plan adopted by 
Mtiller*, which fits the Normal Integral direct to the p values, not 
troubling at all about the differences forming the histogram. In other 
words, Miiller's process fits a curve to Fig. 7, not to Fig. 8, and is 
therefore more direct, in addition to the advantage it has of avoiding 
the tail problem. 

Before proceeding to the explanation of Miiller's process, it is 
necessary to notice that he employed a slightly different form of the 
equation to a Normal Curve from that used in the previous chapter. 
The latter is the form now in general use among biometiicians, and it 
seems desirable that it should also be used by psychometricians, who 
otherwise would be hindered from the direct application to their work 
of the mathematical improvements made by the biometric school, and 
especially would find their use of the valuable tables published by 
Professor Pearson much hampered. In the actual description of Miiller's 
work however it is better for the present to keep to his notation in this 
respect. The form of the Normal Curve used by him was 

In this s is the variable stimulus, T is the average limen, and therefore 
also, since the curve is symmetrical, the median and the mode. So far 
the notation agrees with that used in the expression employed for the 
Normal Curve in the previous chapter, namely 

but a further comparison of the two shows that for his second constant 
Miiller has a quantity h, which is connected with the standard devia- 
tion a of the biometric formula by the relationship 

When the standard deviation is large, therefore, h is small. It is a 

measure of precision, not of scatter. 

The assumption is now made that the relationship between the 

* 6. E. Miiller, "Ueber die Maassbestimmungen des OrtssiimeB der Haut mittels der 
Methode der richtigen und falsoheu FaUe," Pfluger'a Archiv fur die ges. Physiohgie, 1879, 
xrx. pp. 191 — 235, especially par. 5 et seq.: also Die Oesichtspunkte und die Tatsachen 
der psychophysischen Methodik, Wiesbaden, 1904, par. 11, where the classical description 
of this method will be found. 

B. & T. 5 
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stimulus s, and the frequency p with which the answer heavier is returned, 
is given by the equation 

p-r''-A-e-h'i^-T)'ds = (1). 

J -00 V{^) 

That is to say, it is assumed that the successively increasing percentages 
of answers heavier correspond to the increasing area of the portion of 
a Normal Curve which is shaded in Fig. 9, as the point s moves to 
the right in that figure. 

To obtain this equation in a more convenient form for our purpose, 

^'^^ h{s-T) = t (2). 

This corresponds to measuring the stimuli in a special unit, and is the 




T 

Fig. 9. To illustrate MiiUer's process 

same device as that used in another connection later by Galton. The 

equation then becomes 

1 |-ft(s-r) 

^-v(^)L ^"*=^ (^)- 

By inserting in this equation the corresponding values of p and s 
from the table of data on p. 59, we obtain seven equations for two 
unknowns h and T, and as these equations are shghtly inconsistent with 
one another we have to decide how to calculate the most probable 
values of h and T. No pair of values will exactly satisfy all seven 
equations. Instead of coming to zero they leave small residuals v. 
Miiller adopted the Method of Least Squares, an account of which will 
be found on p. 44 in Chapter II. 

In passing a note must be made of the fact that Miiller assumed 
tacitly that these observation equations, being each based on the same 
nimiber of experiments, are of equal importance or " weight*." We shall 
allow this assumption to pass for the present but shall return to it later. 

Unfortunately, the equations are very far from being simple and 

* There is unfortunately a possibility of ambiguity here owing to the fact that 
weights are used as stimuli. 
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linear as in the example on pages 44 and 45. To avoid this difficulty, 
we look up in tables of the Probability Integral* those values of 

V=^h{s-T) (4) 

which correspond exactly to our values of p. 

These equations are not yet linear in T and h, though much simpler. 
If however we write c — hT 

they become y — hs + c = (5)f 

and are now linear in h and c. If we now insert any pair of values 
h and c into these seven (or in general, n) equations, these also will 
leave residuals u, different from those considered above in connection 
with the equations (3). If we were now to proceed to make S (u^) 
a minimum, this would not effect our purpose. It is S (v^) we wish to 
make a minimum, not S (u^). If however we can find multipliers or 
"weights" M such that each 

Mw^ = v\ 
then we can make S (Mu^) • 

a minimum. That is, we can apply Least Squares to the equations (5) 
weighted with certain artificial weights (in addition to any weights 
which may possibly be necessary by reason of there being different 
numbers of experiments at the different stimulus-values). The use of 
this device of artificial weights to overcome the complexity due to the 
non-linear equations is Miiller's particular credit in this connection. 

Clearly the residuals v, which may be regarded as errors in f, are 
connected with the residuals u, which may be regarded as errors in y, 

by the equation \ 

__- e-y' u = v 
V('r) 

from equations (3) and (4). Therefore 

M = e-^y'/n. 

Herein we can omit the tt since it is only the relative values of the 

Miiller weights which are of importance. These weights are, by reason 

of the improved weights to be shortly described, of only historical 

interest. The condition that S (v^) should be a minimum has now 

* The table of the Probability Integral commonly used by psychologists is that known 
as Feohner's Fundamental Table, which is given in Appendix I. It is desirable that 
psychologists should make the slight changes in notation necessary to enable them to 
use better and more generally accessible tables. For example, the first table in Pearson's 
Tables for Biometricians and Statisticians is, except for a factor y'2, identical in significance 
with Fechner's, and gives more values, to more decimal places. 

f Note that this, like (3), represents a set of equations, each of this form. In our 
example there are seven such equations. 

5—2 
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become that S (Mu^) should be a minimum. With this substitution, 
the equations (5) give the two Normal Equations 

S (Msy) - S {M^) h + S (Ms) c = ) . 

S{My) + S{Ms)h-S{M)c==0 j ^'*'- 

Thence we have 

S (Ms) S {Msy) - S (My) S (Ms^) ] 



c = 



S (M) 8 (Ms^) - S (Ms)^ 
M) S {Msy) - S (Ms) S (. 
S (M) S (Ms^) - S (Msf 



^ _ S (M) S (Msy) - S (Ms) S (My) 



.(7). 



The use of these formulae is best explained by an example. Before 
giving one, however, it is well to describe a modification in the weights M 
which was introduced in 1909 by Professor F. M. Urban*, as it is with 
Urban's weights, and not with Miiller's, that we shall actually work. 

These alterations in the Miiller weights, or rather additions to 
Miiller's weights, which Professor Urban made, arise from the notion 
of the probabiHty of a certain judgment, with which we are already 
familiar. The analogy is, as it were, between extracting the answers 
heavier or lighter from a subject, and extracting black or white balls 
from a bag containing a mixture of these colours. Compare, for example, 
the two statements: 

(1) From a bag containing black balls and white balls, 450 drawings 
are made, one at a time, the ball being returned each time before the 
next drawing is made. 403 black balls are observed out of the 450. 

(2) A subject on performing a certain experiment with lifted weights 
sometimes gives the answer heavier, sometimes some other answer. On 
one occasion, when the weights were 100 grams standard and 104 grams 
unknown, this experiment was repeated 450 times, and the answer 
heavier was obtained 403 times out of the 450. 

Now if p is the observed proportion (here 403/450) of black balls in 
a bag, then the probable error of p is known to vary with Vp (1 — p), 
or Vpq'f. With the same sized sample, a result p = -5 has a larger 
probable error than a result p = -8 say. If anything similar holds, as the 
analogy suggests, for the psychometric experiment, then the seven, or n, 

* "Die psychophysisohen Massmethoden," Archiv f. d. ges. Paychol. 1909, XV. and xvi. 
p. 357 et seq. 

•f Cf. p. 32, Standard Deviation of the Binomial Expansion. Really the true values 
of p and q should be used, but this is the best we can do. And further, the expiesdon 
probable error ceases to have an accurate meaning when p is too close to zero or unity 
and the distribution is in consequence very skew. But these refinements do not a&ect 
the argument except in detaiL 
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equations (5) are not equally reliable, even though based on the same 
number, 450, of experiments each. In addition to the Miiller weights M 
they need other weights l/4pg' to allow for this new variation in reliability. 

These weights, it will be observed, arise from the fact that drawing 
balls singly from a bag in this way gives rise to a binomial distribution, 
and the standard deviation of such a distribution is, as is shown on 
p. 32 in the previous chaptet, equal to Vjjg'. The combined weights 
Mjiifq are known as Urban's weights, and are also given in a table in 
Appendix I. Professor Urban discusses the matter at some length in 
his already cited article, and a discussion will also be found in Wirth's 
Psychofhysik (Leipzig, 1912), where on p. 151 the actual scatter of 
various p's is given in a diagram. 

The fact that in the above we have taken the weights as proportional 
inversely to the square of the probable error, fq, need cause the reader 
no trouble, for it is the same phenomenon as the fact that the accuracy 
of a set of readings increases with the square root of the number of 
readings made, as follows from p. 24. The weight of an observation 
equation in an ordinary sense is simply the number of times the observa- 
tion has been repeated, that is it is proportional to n the number of 
observations, which as we have just said is proportional, inversely to 
the square of the probable error. 

Using the symbol W for the combin,ed Urban and Miiller weights 
which are given in tabular form in Appendix I, we have to replace M 
by W in the equations (6). These equations we shall now illustrate 
by giving at some length the calculations indicated by them in the case 
of Urban's Subject I, answers heavier. 

We first look up in Fechner's Fundamental Table*, i.e. a table of the 
probability integral, those values of y which correspond to the observed 
values of p. These are given in the third column of the adjoining table. 
The fourth column of that table gives the values of the Urban-Miiller 
weights W, found from the table in the Appendix. 

Urban' s Subject I, Heavier answers 
Values of 7 and W for subatitution in the normal equations 



Stimulus s grams 


P 


7 


W 


• Ws 


84 


-0022 


-20150 


0^025 


210 


88 


•0200 


-14520 


0187 


1646 


92 


•0889 


-0^9528 


0^502 


4618 


96 


■2222 


- 0^6408 


0^806 


77-38 


100 


•4133 


-01549 


0-982 


9820 


104 


-8956 


0-8888 


0-551 


5730 


108 


•9400 


10993 
* See Appendix I. 


0-396 
3-449 


42-77 
340-39 
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From this table S (W) ia at once obtained, and the other quantities 
appearing in equations (6) are easily formed, though the arithmetic 
is laborious. The work for S (Ws) is given in the last column of the above 
table, and the other quantities are obtained similarly. They prove to be 

S (W) = 3449, 

S{Ws)= 340-39, 

S{Wsy)= -31-223, 

S(Wy)=' - 0-463, 

S (Ws^) = S3700-1. 

The equations (7), reading W instead of M, then give 

Threshold T = 99-68 grams. 

Precision h = 0-136113. 

The standard deviation corresponding to this precision is 

and ■6745ct = 3-5 grams. 

The fact that 99-68, which is thus found for the threshold of answers 
heavier, is actually smaller than the standard 100 grams than which it 
is judged heavier, may cause confusion if it is not at once explained 
that this value involves a time error. 

It will be seen that the Constant Process as described above and as 
illustrated by this example, involves a great deal of arithmetical work, 
so much so indeed that it is certain never to be used except in some 
special cases, unless plans for easing this labour be adopted. Something 
can be done by using the arithmetical short-cuts explained in the pre- 
ceding chapter^ and Crelle's Calculating Tables, or better still a calcu- 
lating machine, makes the work practicable. But the best device for 
reducing the arithmetical work involved in the Constant Process is that 
adopted by Professor Urban in pubUshing his tables for this method. 
These tables are given in Appendix I, and do away with the necessity 
for Fechner's Fundamental Table and the Table of the Miiller-Urban 
weights. They assume that exactly 100 experiments have been made 
at each stimulus-value, but of course if other numbers of experiments 
have been made, the p'a can be approximated to by two significant 
figures. Their use will be readily grasped from the worked example on 
p. 73 below: and that example is also employed in Appendix I to 
explain Kich's useful Checking Table. 
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We give next, as models, the calculations by all the above methods 
for Riecker's data, assuming however that no two-point judgments were 
given at zero stimulus-distance. The data are given on p. 58 and it 
should be noted that the stimuli are not equidistant, and that therefore 
summation methods cannot be used to lessen the arithmetical work. 

Riecker's Data 

(1) Limiting Process. By Urban's Formula 

(a) Just Perceptibly-two Points 



s 


P 


? 


q products 


P 


P« 


P«» 





•00 


100 


1^0000 


■0000 


■0000 


•0000 


0-5 


•10 


•90 


•9000 


•1000 


■0500 


•0250 


1 


•14 


•86 


■7740 


■1260 


■1260 


•1260 


1-5 


•40 


•60 


•4644 


•3096 


■4644 


•6966 


2 


•65 


•35 


•1625 


•3019 


•6038 


1-2076 


3 


•80 


•20 


■0325 


■1300 


•3900 


1-1700 


4 


•87 


•13 


•0042 


■0283 


■1132 


•4528 


6 


•96 


•04 


■0002 


■0040 


■0200 


•1000 


6 


100 


•00 


•0000 


■0002 


•0012 


-0072 


Sums 


10000 


1-7686 


3-7852 



T = sum of Ps = 1-7686 Paris Lines. 
Standard deviation of the just perceptibly-two points, squared, 
= sum of Ps^, less T^ 
= 3-7852 - 3-1279 = -6573, 
whence standard deviation = -81 Paris Lines. 

(b) Just Imperceptibly-two Points 



« 


P 


p products 


P' 


P's 


P's» 





-00 


■0000 


■0024 


•0000 


-0000 


0-5 


-10 


■0024 


■0219 


•0109 


-0055 


1 


-14 


•0243 


■1494 


■1494 


-1494 


1-5 


-40 


•1737 


•2606 


■3909 


•5863 


2 


-65 


•4343 


-2339 


■4678 


-9356 


3 


-80 


■6682 


-1670 


-5010 


1-5030 


4 


-87 


-8352 


-1248 


-4992 


1-9968 


5 


-96 


■9600 


-0400 


-2000 


1-0000 


6 


1-00 


1^0000 


-0000 


-0000 


-0000 


Slims 


1-0000 


2-2192 


6-1766 



whence 



T' = 2-2192 Paris Lines, 
0-' = 1-12 Paris Lines, 
(T + T')/2 = 1^99 Paris Lines, 
Mean of the two as, 0-97 Paris Lines. 
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(2) Linear Interpolation 
for 75 %, 50 % and 25 % points. 



80-75 

75 - 65 ^ 

65-50 

50-40 

40-25 

25-14 

§2 - r = 0-97 
T - Qi = 0-49 



3- 


Q2 


Q2 


-2' 


2- 


-T 


T- 


-1-5' 


1-5 


-<3i 



Q2 = 2-67, 

r = 1-70, 

Qi = 1-21, 
(skew). 



Interquartile Eange = 1'46. 
Semi-interquartile range = 0-73. 
{Qi + T + Q^)/3 = 1-86 Paris Lines. 

Rough value for a, 0- 73/0-6745 = 1-08 Paris Lines. 



(3) Arithmetical Mean (Spearman's Formula) 



« 


P 


dp 


centre s' 


dpxs' 


dpxs'* 





■00 


•10 


•25 


•0250 


•0062 


0-5 


•10 


•04 


•75 


•0300 


•0225 


1 


■U 


■26 


125 


•3250 


•4063 


1-5 


•40 


•25 


1-75 


•4375 


•7666 


2 


•65 


■15 


25 


•3750 


•9375 


3 


•80 


•07 


35 


•2460 


•8575 


4 


•87 


•09 


45 


•4050 


18225 


6 


•96 


•04 


56 


•2200 


12100 


6 


100 


■00 












100 


Sums 


2^0625 


6^0281 



Threshold T = 2-0625 Paris Lines. 

Square of standard deviation = 6-0281 - T^ = 1-7742, 

Standard deviation = 1-33 Paris Lines*. 



* Without Sheppord's correction, for which see p. 84. 
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(4) Constant Process 
using Urban's Tables. (Appendix I, p. 194) 



s 


working s 


P 


W 


yW 


sW 


s'W 


syW 





-6 


-00 


-0000 


•0000 


•0000 


-0000 


•0000 


0-5 


-5 


-10 


-5376 


--4871 


-2^6878 


13-4388 


2-4356 


1 


-4 


-14 


-6463 


--4937 


-2-5853 


10-3413 


1-9749 


1-5 


-3 


-40 


■9768 


--1750 


-2-9306 


8-7916 


-5252 


2 


-2 


-65 


-9473 


-2581 


- 1-8945 


3-7890 


--5163 


3 





-80 


-7695 


-4579 


-0000 


-0000 


-0000 


4 


2 


-87 


-6215 


-4950 


1-2430 


2-4860 


•9900 


5 


4 


-96 


-3036 


-3759 


1-2146 


4-8582 


1^5036 


6 


6 


1-00 


-0000 


-0000 


-0000 


-0000 


•0000 






Sums 


4-8026 


1-5869 
- 1-1558 

•4311 


2-4576 
- 10-0982 

-7-6406 


43-7049 


7-4293 
--5163 
6-9130 



- 7-64 X 6-91 - -431 x 43-7 
4-80 X 6-91 + -431 x 7-64 
in working units from the working origin 
1-96 
2 
4-80 X 6-91 + -431 x 7-64 



1-96 



= 3- 



2*02 Paris Lines, 

= '241 in \^orking units, 



whence 



4-80 X 43-7 - 7-642 
CT = l/(-v/2^) = 2-93 working units 
= 1-46 Paris Lines. 

Titchener, using the Constant Process with Miiller weights alone 
(the above is with the Miiller-Urban weights), obtained 

T = 1-88 Paris Lines, 
h = 0-49, 
whence cr = 1'44 Paris Lines. 

Summarising in one table we have: 

Riecher's Data calculated by different Processes 
Process Threshold T Scatter it 

Limiting Process 1-99 0-97 



Linear Interpolation 


1-86 


1-08 


Arithmetical Mean 


2-06 


1-33 


Constant Process 


2-02 


146 



The Probable Error of the Thresholds calculated in these different ways. 
The values of the standard deviation given in the table immediately 
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above refer of course to the variation of the individual limina of which 
the threshold T is a central measure. If there are n of these individual 
limina, then in an ordinary way the arithmetical mean of these would 
have a standard deviation of a/-\/n. The standard deviations of the 
above values of T are indeed of this order of magnitude, say in the 

^''^^ "^^^ 0-97/^(100) = -097, 

but to avoid misconception ought to be regarded as distinctly larger 
than this. This arises from various causes which cannot here be gone 
into. In the case of the Limiting Process there is the dependence upon 
the particular choice of stimuli, in the case of Spearman's Arithmetical 
Mean formula there is the uncertainty about the centring of the 
"tails" of the distribution, etc. 

Professor F. M. Urban, some twelve years ago*, brought forward 
reasons which in his opinion showed that the Method of Limits was 
much more exact than the Constant Method: but his mathematics 
contained certain errorsf . The various processes do not differ very 
widely in this respect if each is used in circumstances favourable to 
itself, but on the whole the Constant Process is most reliable, and 
Linear Interpolation least. 

In conclusion, we may venture to express a cautious opinion on the 
choice of a process of calculation from among those given. Frequently 
of course there is no choice, for the conditions of experiment fix the 
matter for us. For example, if the just perceptible points have been 
recorded but not the frequencies p of answers of a certain kind at each 
stimulus-value, then the direct Limiting Process must be employed. 
We will suppose however that the fullest records have been taken. 

If the points at which p = and p = 1, that is the points where only 
one kind of answer is given, are known or are very nearly approached, 
the Arithmetical Mean of Spearman is in our opinion best. 

If however, as is frequently the case, these points are unknown, then 
the simple linear interpolation for the 25 %, 50 %, and 75 % points is 
a good plan from the point of view of simplicity and is often of sufficient 
accuracy. 

If the accuracy of the data justifies the use of the full Constant 
Process, then Urban's Tables lighten the work enormously, and give 

* "Die psychophysischen Massmethoden als Grundlagen empirischer Messungen," 
Archivf. d. ges. Psychol. 1909, iv. and xvi. pp. 261 — 415. 

t G. H. Thomson, "Note on the Probable Error of Urban's Formula for the Method 
of Just Perceptible Differences," Brit. Journ. Psychol. 1913, vi. p. 217 and "The Accuracy 
of the Phi-gamma Process," ibid. 1914, vii. p. 44. 
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perfect accuracy if exactly 100 experiments have been made at each 
stimulus. The great advantage of the Constant Process lies in the fact 
that the "tail" difficulty does not arise. Experiments in which it is on 
psychological grounds inadvisable to employ extreme stimuh can only 
be handled by this process. 

But it is not worth while applying it unless the calculator is assured 
that the experimental accuracy justifies it, and unless he has convinced 
himseK, by methods to be described in the next chapter, that the 
distribution is not significantly skew, and the data not heterogeneous. 
Very few collections of psychophysical data are worth the accuracy of 
the Constant Process, which is however undoubtedly the best theoreti- 
cally for symmetrical distributions. 

(5) DIFFERENCE THRESHOLDS AND THE PROBABILITY OF A 
JUDGMENT OF A CERTAIN CATEGORY 

A question closely bound up with the mathematics of the psycho- 
physical methods is that of the best measure of a subject's sensitivity 
to differences of stimulus-value. 

To fix ideas, we shall use the case already discussed of the difference 
threshold for lifted weights. When a sufficient number of judgments 
has been collected, the three categories lighter, equal or undecided, and 
heavier, are found to occur with varying frequency with the different 
comparison weights. The difference threshold is then decided by the 
positions of the points T and T' where the descending lighter and 
ascending heavier curves cross the halfway hue (see Fig. 7, p. 59). The 
distance {T — T')j2 or some closely similar quantity is what is called the 
difference threshold, and is commonly used in comparing the sensitivity 
of different subjects. The smaller T — T', the more sensitive the subject 
is said to be. 

This distance however depends entirely on the subject's readiness 
to give the answer undecided. It measures therefore rather a moral 
characteristic than a physical sensitivity, and varies very much with 
the instructions given to the subject. The moral character of the 
measure T — T' is above all seen from the fact that any subject who 
wishes may reduce it to zero, whatever may be his actual sensitivity to 
differences of weight, simply by determining that he will never give the 
answer undecided. 

There is however another measure which has been used. This can 
be most conveniently described by considering first a case in which a 
subject gives no undecided answers. In such a case, the thresholds TandT' 
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have come together and on the previous plan the subject's sensitivity 
would be considered as infinite, and all subjects giving no undecided 
answers would have the same infinite sensitivity: whereas clearly the 
subject's sensitivity is connected with the rapidity with which the 
curves pass from to 1 or vice versa, and two subjects may differ very 
much in this respect even although they both give no undecided answers. 
Under these circumstances a measure which has been used is the distance 
Q — Q', the horizontal distance between the crossing of the -25 and '75 
lines (see Fig. 7, p. 59). Under another guise it was used by Fechner also 
for the cases where undecided answers were given. In such cases he 
reduced the three curves to two by sharing the undecided answers 
between heavier and lighter. 

This measure has' the advantage that the subject cannot increase 
his apparent sensitivity at will, as was the case with the "threshold" 
measure. Q — Q' is the interquartile range of the point of subjective 
equaUty, represented by the crossing of the heavier and lighter curves. 
It and the difference threshold measure distinctly different things, 
and subjects placed in order of merit by the one will be found in a 
different order by the other. 

The points here raised seem to suggest an extension of Urban's idea 
of the probability of a judgment, which compares the giving of the 
judgments, heavier, undecided or lighter, with drawing a ball from an 
urn containing say red, white, and blue balls, and ascertaining its colour. 
For each stimulus the urn is supposed to contain different porportions 
of the coloured balls. 

In place of this is suggested the following. For each stimulus 
imagine an urn containing an infinite number of balls some black and 
some white, in a proportion varying in some way with the stimulus. 
A judgment may then be compared with taking not one but a handful 
of balls from the urn, the kind of judgment depending upon the pro- 
portion of black balls in the handful. 

From this point of view, the standard weight, the variable weight, 
and the physiological make up of the subject decide the proportion of 
black balls in the urn: but the decision as to what proportion is to be 
called heavier, what undecided, and what lighter, depends upon a con- 
scious act of the subject. 



CHAPTER IV 

SKEWNESS AND HETEROGENEITY IN PSYCHOPHYSICAL DATA 

Obvious skewness of many psychophysical curves — Pearson's test for goodness of 
fit applied to the method of average error — ^Applied to the method of right and 
wrong cases — Skew curves in homogeneous material — The summation method of 
finding moments — Calculation of a skew curve — Analysis into two normal curves — 
Conclusions. 

(1) OBVIOUS SKEWNESS OF MANY PSYCHOPHYSICAL CURVES 
To anyone accustomed to handling distribution data, a most striking 
point about the results of many psychophysical experiments is the 
obvious skewness of much of the data. Both the examples used ex- 
tensively in the previous chapter show this very strongly as can be seen 
from an inspection of the data either in numerical or diagrammatic 
form, and also from the various values of the threshold T found by the 
different processes of calculation : for these differences arise largely from 
the fact that the distribution is not normal. 

Taking the best of the processes, the Constant Process, it is of 
interest to see how closely the curve which it gives fits the original data. 
A method of thus estimating the goodness of fit of curves has been given 
by Professor Karl Pearson. His method is perfectly general, and 
applicable to all classes of curves*, but it has been most fully worked 
out for the fitting of bell-curves to histograms. Our problem is not of 
this nature, though it might appear to be so, for the pseudo-histogram 
(Fig. 8) which can be formed from the frequencies f differs essentially 
from a real histogram. Since in psychophysics it may often be necessary 
to fit curves to real histograms, for example those obtained in the 
Method of Average Error, we shall first explain Pearson's Goodness of 
Fit Test for this case, using the bisection data of Chapter II for the 
purpose (see pp. 15 and 42 and Fig. 6). 

(2) PEARSON'S TEST FOR GOODNESS OF FIT APPLIED TO THE 

METHOD OF AVERAGE ERROR 

Thebisection data had a mean of 60'13 mms. and a standard deviation 

of 1-38 mms. With these values, using Sheppard's Tables, we draw the 

smooth curve shown in Fig. 6. Now it is important at the outset to 

* Phil. Mag. July 1900, Fifth series, L. pp. 157 — 175. PUl. Mag. April 1916, Sixth 

series, xxxi. pp. 369—378. 
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realise that whether that curve is a good or bad fit to the data depends 
on the number of observations made. The number in this case was 
only 29, and it will presently be shown that the curve is a very good fit. 
But had the number of observations been 2900 it would have been a 
bad fit, for with such a number of observations the histogram ought to 
have modelled itself more closely to the curve. 

In order to apply Pearson's test, we must find the theoretical histo- 
gram for comparison with the observed histogram. That is, we must 
find the areas of the slabs of the curve in Fig. 6 which replace the 
rectangles. (One of these slabs is cross-hatched in that figure to explain 
more clearly what is here meant.) This is most easily done from 
Sheppard's Tables by calculating the areas of the smooth curve from 
— 00 up to each dividing ordinate in turn, and taking the difEerences 
of these numbers, as is done in the following table. The quantity 
1(1 + a) in Sheppard's Tables is the area of a Normal Curve, of unit 
total area, up to the ordinate xja. For negative cc's it has to be subtracted 
from unity. 



Calculation of the Theoretical for comparison with the Observed 
Histogram of the Bisection Data 



z mms. 


a;' =a: -60-13 


Sheppard'a 
a;=a;71-38* 


Sheppard'a 


MultipUed 
by 29* 


Difieiences 












009 


63-95 


3-82 


2-77 


■997 


28-91 


0-52 


62-95 


2-82 


2-04 


•979 


28-39 


2-09 


61-95 


1-82 


1-32 


-907 


26-30 


5-36 


60-95 


0-82 


0-59 


•722 


20-94 


7-95 


59-95 


-0-18 


-0-13 


•448 


12-99 


7-33 


58-95 


-1-18 


-0-86 


-195 


5-66 


4-01 


57-95 


-2-18 


-1-58 


-057 


1-65 


1-36 


56-95 


-318 


-2-31 


-010 


0^29 


0-29 




29-00 



The actual and theoretical histograms are then compared in the next 

table. Pearson's method, the theory of which cannot be given here, 

then forms the quantity 

2 _ a /square of differences of theoretical and observed frequenciesX 

\ theoretical frequency / " 

With this value x^, and n' the number of cells in the histogram. 

Table XII in Pearson's Tables is then entered and a value of P found. 

* These can be calculated at one opening of Crelle'e Tables, or by one setting of a 
slide rule. 
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n' is here 9 counting in the two tail cells. P is then the probability that 
the observed or a worse distribution will be obtained (assuming the 
theoretical distribution) in a sample of the size taken, here 29. 

Calculation of Goodness of Fit of Normal Curve to the Bisection Data 



mms. 


Actual observa- 
tions m 


Theoietical m' 


(m - m'YIm' 


Above 63-9 
63—63-9 
62—62-9 
61—61-9 
60—60-9 
59—59-9 
58—58-9 
57—57-9 

Below 57 



1 

2 
6 
6 
8 
5 
1 



0-09 
0-52 
2-09 
5-36 
7-95 
7-33 
4-01 
1-36 
0-29 


•090 
-443 
•004 
-076 
•478 
•061 
•244 
•095 
•290 




29 


2900 


b781=x' 



No. of cells n' = 9. 

From Elderton's Tables, No. XII in Pearson's Tables, 

P = 0-998 for x^=l, 
0-981 for x^ = 2, 
therefore P = 0*99 approximately. 

In our case we find P = 0^99, i.e. 99 samples of this normal distri- 
bution out of 100 would give no better a fit than the present. The 
Normal Curve is therefore a perfectly satisfactory theory for these 
29 observations. 

(3) PEARSON'S TEST OF GOODNESS OP FIT APPLIED TO THE 
METHOD OF RIGHT AND WRONG CASES 

The above plan of testing goodness of fit cannot however be applied 
to the pseudo-histogram of Fig. 8*. The reasons why this is so cannot 
be here gone into in detail, but they are based upon the following 
differences between the two cases. In a real histogram, if any one of 
the cells is larger than it ought to be, then any other must have a 
tendency to be smaller than it ought to be. There is a strong negative 
correlation between the numbers in the cells, a correlation, that is, from 
trial to trial. In the psychometric pseudo-histogram, however, formed 
from the proportions p, this is otherwise, because the p's are measured 
quite separately from one another. In a real histogram the numbers in 

* G. H. Thomson, "The Criterion of Goodness of Fit of Psychophysical Curves," 
Biometrika, 1919, xn. pp. 216—230, 
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each cell are necessarily positive quantities. In the psychometric 
pseudo-histogram they may be negative, if the p'a do not rise steadily. 
Psychometrical data of the kind here considered, in fact, as has 
already been pointed out, are not really in histogram form. Although 
a kind of histogram can be deduced from them, it is only by making 
certain assumptions, and the intercorrelations of the cells of this arti- 
ficial histogram are different from the intercorrelations of a naturally 
observed histogram. Under these circumstances we must in applying 
the goodness of fit test turn to the directly observed quantities p. These 
are compared with the values calculated from the Constant Process in 
the following table, the remaining columns of which are explained below: 



Calculation of Goodness of Fit of a Constant Process Ogive 

Urban'a Subject I, answers heavier 



Grams < 


Observed p 


Calculated p' 


p-p' 


(p-pr/pV 


84 


•0022 


•0013 


•0009 


■001 




88 


•0200 


•0122 


•0078 . 


■005 




92 


•0889 


•0697 


■0192 


■006 




96 


•2222 


•2394 


- ^0172 


■002 




100 


•4133 


■5246 


-•1113 


•050 




104 


•8956 


•7972 


•0984 


•060 




108 


•9400 


■9454 


-•0054 


•001 






■125=S{{p- 


-p'YIp'^} 



X^ = 450 X ^125 = 56-25, 

w' = 8 (one more than the number of stimuli), 

P = -0000005 from Pearson's Tables. 

To these quantities f the principles underlying Pearson's test can 
be applied direct. They are indeed the same principles already used in 
Chapter II when we were discussing curve fitting. We have n quantities 
p which are independently measured, and n quantities f' which are 
theoretically given. The variations of f from f' are binomial in form, 
that is approximately normal. If we look upon the judgment heavier, 
as suggested in an earlier paragraph, as being comparable with drawing 
black balls out of a bag containing black balls and white balls in the 
proportions f' and 1 — f', then the probable error of f is 

•6745V{p'(l-p')/A}. 
Ic being the number of judgments of which ph are of the category heavier. 
For the chances of obtaining 0, 1, 2, ... , k— 1, or h black balls in 
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a drawing of k are given by the terms of the binomial (p' + q'f, q' being 
1 — ?'; that is, the chances of obtaining 

TP = Ojh, 1/k, ...{k- Vjjk, k/k or unity. 
The standard deviation of the above binomial is V(^'?')) ^^^ the 
standard deviation of p therefore wViWq'h or Vip'i'l^)- The proba- 
bility of an error p — p' is therefore 



e 



V(^p'q') 

The probability of the whole set of observed values p^, f^, ... p„ 
occurring is the product of n such factors, and is of the form 

where ;^2 = ^ |^ (£z;^'| , 

or if k is the same at each stimulus, 

The remaining columns in the above table calculate this quantity*, 
which is the same as Pearson's x^, under our special circumstances. 
For n' we have to use n + \, because all the n values of p can vary 
separately, whereas in a real histogram the number of variables is one 
less than the number of cells. We thus reach the value P = -0000005, 
that is, the curve is an incredibly bad fit to the data, which cannot 
possibly be regarded as differing from a normal distribution by sampling 
errors alone. 

(4) SKEW CURVES IN HOMOGENEOUS MATERIAL 

There are two immediate hypotheses which present themselves to 
explain this bad fit of the normal curve, (a) that the material is not 
homogeneous, the conditions of experiment not having remained the 
same throughout, (b) that the material is homogeneous, but the under- 
lying factors which cause the distribution of errors or deviations are 
not independent, but correlated, as in the system of generalised proba- 
biUty curves next to be described. A word of caution may first be given, 
for, as we have already suggested, the identification, or even the 
approximation, of graphical representations of "percentage" judgments 
to true frequency-distributions is of extremely doubtful validity. 

* A table in Appendix I, giving values of l/(pg), lightens the work considerably. 
B. & T. 6 
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The general theory of curve fitting has been worked out in great 
detail by Professor Karl Pearson*. A good account of his method is 
given in a book by W. Palin Elderton, Frequency Curves and. Correlation, 
C. and E. Layton, London, 1906, to which the mathematical reader is 
referred for fairly complete theoretical and practical information. 

Frequency curves of data not involving a mixture of species tend 
to commence at zero, rise to a maximum, and then fall either at the 
same or at a different rate. There is often high contact at one or both 
ends of the distribution. An equation of the general form 

dy ^ {x + a)y 
dx f{x) 
satisfies both these conditions, since if y = 0, dyjdx = (high contact), 
and if x= — a, dyjdx = (maximum; for a maximum, again, the 
second differential coefficient must be negative). Expanding / (a;) by 
Maclaurin's Theorem, we have 

^_ {x + a)y 

dx Co + c^a; + c^^ + c^ + ...' 

(1) Putting Ci = Cg = Cg = ... =0, we have 

\ dy _x -\- a 

ydx Co""' 
which is the Gaussian or normal curvef. It fits the symmetrical binomial 
(I + 4)"> ®-g- ill •^oin tossing, where the chances for and against are 
equal (p = q), and the contributory causes are independent of one 
another. 

(2) Putting Cg = Cg = ... = 0, we have 

\ dy _ X + a 

y dx Cq + c^' 
which represents a class of curves varying from the Gaussian curve to 
the J-curve. It fits the asymmetrical or "point" binomial (p + qY, e.g. 
in teetotum spinning or dice throwing, where the chances for and 
against are not equal, 'pi=q, but the contributory causes are still 
independent of one another. 

» Karl Pearson, "Skew Variation in Homogeneous Material," Phil. Trans. 1895, 
CLXXxn. A, pp. 343 £E. ; " On the Systematic Fitting of Curves to Observations and Measure- 
ments," BiomelTika, I. pp. 265 ff. and n. pp. 1 ff., 1901 — 3; "On the Curves which are 
most suitable for describing the frequency of Random Samples of a Population," 
Biometrika, 1906, v. pp. 172 — 5 (an exceedingly clear summary of the principles involved). 
Also later papers in Phil Trans. 1901, oxovn. A, pp. 443 — 459, and 1916, being supplements 
to the first mentioned memoir. See also pp. Ix to Ixx in Pearson's Tables for Statisticians 
and Biometridans, Cambridge, 1914. 

t Compare equation (5), Chap. 11, p. 34. 
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(3) Putting 03 = 64 = 



Idy 
y dx 



0, we have 

X + a 



^0 ~^ ^V^ "^ ^2"^ 

which can be made to represent almost all the frequency distributions 
which may arise. It fits the hypergeometrical series, the successive 
terms of which, e.g. give the chances of getting k,Jc — I, ... black balls 
from a bag containing pn black and qn white balls when k balls are 
drawn*. 

Here the contributory causes are not independent of one another. 

There is no advantage in employing equations which involve Cg and 
higher constants, because their use necessitates the calculation of the 
6th and higher "moments," and these have very high probable errors. 

Definition. The nth moment coefficient (ij,„') of any distribution about 
any ordinate is the sum of the products of the partial frequencies and 



A 


Centroid 
vertical 




j\ 


^ 


1 V 



t 



axis of X 



Fig. 10 



the wth power of the distances of these frequencies from the ordinate, 
divided by the total frequency. In symbols, if iV be the total frequency, 



iV/i„' = I oS^yhx. 



The moments are, in practice, first calctzlated about any arbitrary 
ordinate that is most convenient, and then reduced to moments j,bout 
the centroid in the following way (dashed /I's represent moments about 
an arbitrary ordinate, undashed /a's about the central ordinate) : 

xfyU = Nfin' - nxNii'„_^ + ^-^^ ^NiJi'„_^ - 



This gives the general reduction formula 



1.2 



* The aeries is 
pn,{pn-l) ... {pn - k + 1) L kgn 



fe(fc-l) 



qn {qn - 1) 



pn-h + l 2! ' (pn-k + l){pn-k 



T2)+-}- 



Other aeries may arise. 



6—2 



84 PSYCHOPHYSICS [ft. i 

which enables us to transfer any moment from an arbitrary ordinate 
to the mean. Thus we have: 

t^i = H-i - Wl^' + ^fh.'W - 3^'* 
The symbols fi represent moments of the curve; but we have to 
start with grouped frequencies, where the frequencies are assumed to 
be concentrated along the mid-ordinates of the rectangles (cf. Fig. 105). 
The moments obtained from these grouped frequencies are denoted by 
v's (dashed and undashed), and corrections are necessary. These have 
been deduced by Sheppard* and are consequently known as Sheppard's 
adjustments. They are: 

M2 = "2 - A' 

Ml = "4 - i''2 + ^■ 

It is generally said that they are only valid when there is "high 
contact" at the ends of the frequencies, but the equations for /aj and fi^ 
are probably still vaUd even without high contact, if the terminal 
frequencies are zero. 

(N.B. In working, v"a are changed into i/'s before applying Sheppard's 
corrections.) 

It is obvious that vq = 1 and v^ = 0. v^' is the distance of the mean 
or centroid vertical from the arbitrary ordinate about which the moments 
are first taken, and is conveniently known as d. 

Two very important constants in curve fitting are 

^, = ^;,and^, = -^. 

/*2 ^2 

The values of these are always to be calculated, and, within the limits 
of their probable errors, they fix the type to which the curve belongs. 
The general frequency curve equation, written in terms of moments, is 

^,W^i (^2 + 3) 

Idy^ ^ 2 5;6a-6^i-9 

ydx ( 4;8,-3^i V^, (;8, + 3) x 2^, - Sp, - 6 /x\Y 

Il0i32-12i3i-18^ 2 5^2-6^j^-9'al0^^-12^i-18\a)\ 

* W. F. Sheppstrd, "On the Calculation of the moat Probable Values of Frequency 
Constants, for Data arranged according to Equidistant Divisions of a Scale," Proc. Land. 
Math. Soc. xxix. pp. 363 S. Karl Pearson, " On an Elementary Proof of Sheppard's 
Formulae for correcting Baw Moments and on other allied Points," Biometrika, 1904, m. 
pp. 308 ff. 
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This gives at once, for the distance between the mean and the mode, 

2 (5)82 -6)81 -9) 
(origin is at mean). 

Hence the curve is symmetrical if /S^ = 0. If )8i = and ^^ — ^> *^® 
curve reduces to the Gaussian or Normal Curve, since the terms in- 
volving X in the denominator of the right-hand side of the general 
frequency curve equation then vanish. 

In using the Pearsonian method, then, the order of procedure to be 
adopted is: 

(1) Calculate the moment coefficients v^', v^', v^', v^ about a con- 
venient arbitrary ordinate, 

(2) Transfer to the mean by the equations 

"2 = "2' - "i'^ 

»'3 = »'3'-3>'iVij'-|-2vi'3, 

{y^ or d is the distance of the mean from the arbitrary ordinate.) 

(3) Determine the corresponding moments for the curve by the 
equations 

/^ = »'2 - A ] 

jiig = V3 > Sheppard's corrections. 

/*4 = "4 - ¥i + ^J 
(N.B. For these corrections to be applicable, two conditions must be 
fulfilled: 

(i) there must be high contact, 

(ii) the grouping of the frequencies must be eqtial.) 

(4) Calculate yS^ and p^ ^J t^^ equations 

These results give the distance of the mean from the arbitrary ordinate 
{v-^' or d); the staTidard deviation (V/^); ^^^ t^i^ mode 

oVM^ + 3) 
°''^°"2(5;8,-6i3i-9)- 

The median is more difficult to determine exactly, but a position 
which is approximate and indeed very accurate in all the curves we are 
likely to meet with in this work, is between the mean and the mode, but 



86 



PSYCHOPHYSICS 



[PT. I 



nearer to the mean, so that the distance from the mode is twice the 
distance from the mean*- 

Mean Median Mode 



1/3 2/3 

The investigator may then proceed to determine to which of the 
Pearsonian "types" the particular curve belongs, to find its equation, 
and to plot it. The type is decided by the constants ^j and jSg (using 
Diagram XXXV, p. 66, Pearson's Tables) or by the criterion Kg where 

"^ 4(4j82-3i3i)(2^2-3i3i-6)' 
using the following diagram: 



Typel 




Type V 



Type VI 



The equations to the types are given in detail in Pearson's Tables, 
p. Lsdii, or in Elderton (op. dt.), where valuable advice on the arrange- 
ment of the calculations will be found. Elderton's Type II includes 

* Karl Pearson, Biometrika, i, 1902 — 3, p. 265; Phil. Trans. OLxxxvi. A, p. 376; Boy. 
Soc. Proc. Lxvin. p. 369b. C. V. L. Chariier, "Researches into the Theory of Probability," 
Liunda Universitets Arsakrift, 1905 (1), 1, equation 9. Arthur T. Doodson, "Eelation of the 
Mode, Median, and Mean in Frequency Curves," Biometrika, 1917, xi. p. 425. 
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both our Types II and VII, and his Type VII is our G, the Normal 
Curve. 

We shall confine ourselves to a fuller description of Type IV, which 
appears to be the type most common in psychometric skewness*. 

It will be clear from the above account that the whole of the calcula- 
tions are based upon the first four moments of the data, and we proceed 
first to describe the most convenient way of finding these when as here 
the stimuU are equidistant, viz. the summation method already used 
for finding the mean on p. 19. If the stimuli are not equidistant the 
calculations are rather longer. 

(5) THE SUMMATION METHOD OF FINDING MOMENTS 
IN THE CASE OP DATA AT EQUIDISTANT POINTS 

A full description of this device will be found in Mr Palin Elderton's 
book already cited, where it is attributed to Mr G. F. Hardy. It has 
however been independently used by numerous writers, e.g. Lipps, 
Wirth, Urban, etc.f 

The theory cannot be worked out here, but the reader can easily do 
so for himself. It is only a question of simple algebra; or from another 
point of view it is the same thing as integration by parts. We give only 
a worked example, Urban's Subject I, heavier answers. 



Example of the Use of the Summation Method 



Grama 


V 


series 


of successive sums 


84 


•0022 


•0022 


-0022 


•0022 


88 


•0200 


•0222 


-0244 


-0266 


92 


■0889 


•1111 


•1355 


-1621 


96 


•2222 


■3333 


■4688 


-6309 


100 


•4133 


•7466 


1-2154 


1-8463 


104 


•8956 


1-6422 


2-8576 


4-7039 


108 


■9400 


2-5822 


5-4398 


10-1437 




2^5822 


64398 


10-1437 


17-5157 




S^ord 


Ss 


s. 


s. 



* Thirteen out of fifteen psychometric curves tried recently were Type IV. Cf. Pearson 
on zoological and anthropological curves, where Type IV also prevails, Phil. Trans. 1896, 
OLXXXVI. A, Part I, pp. 388, 403 and 411. 

t 6. F. Lipps, "Die Theorie der Collectivgegenstande," Wundt's Phil. Stud. Bd. rvn. 
Separat-Abdruck, Leipzig, 1902; W. Wirth, "Die mathematischen Grundlagen der soge- 
nannten unmittelbaren Behandlung psychophysischer Resultate," Wundt's Psychol. 
Studien, 1910, Bd. vi. pp. 141, 252, 430. Urban on Wirth, Archiv f. d. ges. Psychol, xx. 
Literaturbericht, p. 1. 



88 PSYCHOPHYSICS [pt. i 

The table is self-explanatory. Each column is formed from the 
preceding one by successive summations (from the top in this case), 
and is then totalled*. The origin is here the centre of the group beyond 
108 grams, viz. 110 grams, and the unit of measurement is 4 grams, 
measured downwards from 110 grams. We have 

Mean = 110 - 4rf = 99-6712 grams. 

Further, it can be shown that the moments 

i/a = 2Ss-d{l + d), 

vs = 6S4 - 3v2 {l + d)-d{l + d){2 + d), 

Vi^2iS^-2vs{2{l + d)+l} 

- V2 {6 {l + d){2 + d)-l}-d{I + d) (2 + d){3 + d). 

The work is not heavy up to this point if arranged systematically, 
and in the present case it gives 

1/2 = 1-6296, 
V3 = 0-9645, 
vt = 9-1621, 

or, using Sheppard's corrections, 

/LI2 = 1-5463, a = y/tH = 1*2435, 

JLI3 = 0-9645, 
Hi = 8-3765. 

This a gives in original units 4 x 1-2435 = 4-974 grams, differing from 
the value mentioned on p. 63 because of the Sheppard adjustment. 
From the moments we obtain 

Pi = 0-251, 
P2 = 3-51, 
^2 = 0-775. 

The type is therefore Type IV; within the limits of probable error 
of )8i, jSg, and /cg it might however be Type G, VII, or V. Unfortunately 
the ordinary methods of finding these probable errors are of doubtful 
significance in the case of pseudo-histogram data such as ours. We turn 
to the calculation of Type IV. 

* Slightly greater accuracy could, by the way, be attained by using the actual numfaeia 
of answers heavier and not the proportions p in cases like the present where an awkward 
number. of experiments was performed at each stimulus, viz. 450, leading to recurring 
decimals. The totals would then be divided by this number. 
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(6) CALCULATION OF A SKEW CURVE (Type IV) 

The equation* is ^ 

(^2\ —m —V tan—' - 

wherein m = | (»• + 2) say, 

where r = 6 (jS^ - /S^ - Y)^^ - 3^^ - 6), 

r(r-2)V)8, 



V{16(r-l)-i8i(r- 

Sk (skewness) = i/(4m). 
Origin, at mean + va/r; 
mode, at mean — skewness x a; 

oos^ <f> I 
12r' 



m 



-— rljn say, 



N ^r e 



3j- 



<j>v 



tan <l> ■■ 



■^" a V(27r) (cos ^y+^ 
The actual form of the calculation depends on the appliances avail- 
able, whether Crelle's Tables, or logarithms, or calculating machines. 
Using Crelle and seven figure logarithms we get finally, after much 
labour, the following values for the ogive, which has to be obtained 
from the bell-curve by simple quadrature: 

Vrban's Subject I, HeaTder answers 



Stimulus 


Observed p 


Calculated hy 
Type IV 


ip-p'mpY) 


84 

88 

92 

96 

100 

104 

108 


•0022 
•0200 
•0889 
•2222 
•4133 
•8956 
•9400 


•0046 
•0184 
•0690 
•2155 
•5006 
•8078 
•9624 


•0012 
•0001 
•0062 
•0002 
•0305 
•0496 
•0139 






Sum •lOl? 



X^ = 450 X -1017 = 45-8, P < -0000005 still. 

We find therefore that fitting the best Pearson curve possible to the 

data makes practically no improvement in the fit, which is still so very 

bad as to make it quite certain that the data are not homogeneous at 

all. The fit cannot be much improved by other assumptions as to the 

spread of the "tail," several of which have been tried. The Type IV 

* The notation on this page is that of Elderton following Pearson, and the symbols 
m, •/, n and r have no connection with these symbols used elsewhere in the present book. 
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curve itself is shown, contrasted with the pseudo-histogram, in Fig. 12. 
The mean (99-67), median (99'97) and mode (100'52) are worth com- 
paring, as a matter of interest, with the thresholds obtained otherwise 
(see pp. 53, 60, 63 and 70). 

The reason for the bad fit in this case is, mathematically, the im- 
possibility of finding a curve to accommodate both the tall rectangle 
217, and the tail of 27, however the latter may be allocated. Much of 
Urban's other data shows the same bad fit, and for the same reason, the 
size of the "tails." Not all however are bad. The best case is Subject II 




84 88 92 96 100 104 108 

Fig. 12. A Type IV curve fitted to Urban's data for Subject I, lieavier 
answers, in pseijdo-Iustogram form 

(Urban himself) who had had much practice at this form of eooperiment. 
The lighter answers in his case were as follows, compared (1) with a 
curve fitted by the Constant Process and (2) with a Pearson skew curve 
(here Type I). 

Urban's Subject II, Lighter answers 



Grams 


Observed p 


Normal Curve p 


Type I Curve p 


84 


■9333 


■9504 


•9432 


88 


•8622 


•8540 


•8520 


92 


•7000 


■6767 


•6875 


96 


•4489 


•4456 


•4627 


100 


•2311 


•2320 


•2379 


104 


•0956 


•0922 


■0858 


108 


•0166 


•0272 


■0187 



On testing the goodness of fit we obtain 

For Normal Curve, P = 0-48. 
For Type I Cunre, P = 0-91. 
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Here the Gaussian is a good, and Type I an excellent fit. There is every 
reason then to think that the data here are homogeneous. The bell- 
curve and pseudo-histogram are shown in Fig. 13. 

Since we have decided that the data of Urban's Subject I are hetero- 
geneous, that is, that the conditions of the experiment varied consider- 
ably during its performance (which lasted several months), the question 
naturally arises as to whether we can analyse the data mathematically 
into two or more frequency-distributions. This question was discussed by 
Professor Pearson in 1894* as far as an analysis into two normal curves 
goes. One more moment, fj^, is needed and as the probable error of this is 
considerable the practical application of the plan to be described is not 
very satisfactory. 

h200 



-100 



Tail =30^ 





/ 


113 


-^«^ 


s. 






98 






73 




/ 


s 

61 




'^ — 

32 


36^ 



JTail =7 



80 84 88 92 96 100 104 108 

Mg. 13. A Type I curve. Urban's Subject TI, lighter answers 

(7) ANALYSIS INTO TWO NORMAL CURVES 
StcK/e I. Find the centroid of the frequency curve and calculate 
^. /*3. /t4. M6> K and Ag. ^^ ^ q^^% _ 3^^^ 

A5 = 30jti2/i3 — S/ig. 

Stage II. Solve the following nonic equation for pg using Sturm's 
functions to localise roots : 

24^2* - 28A4P2' + 36j.^V - (^if^B - 10A4^) p^ 

- {U8iJL,n, + 2A52) Pa* + (2881.^3* - 12A,A5;^ - A/) p^^ 
+ (24^2As - 7/.3%'') p,^ + 32)L^*A4p, - 2if^^ = 0, 

and find pi from 

2/n2» - 2fi^2 - ^iPi - ^IhPi 
P^^- W-KP2 + ^P2' 

* Phil. Trans. Boy. Soc. London, 1894. 
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Stage III. Find y^ and y^ the roots of 

/ - piy + P2 = 0- 
Ayi and Ayg are the positions of the axes of the normal component curves, 
where h is the unit of length. 

Stage IV. The fractions z^ and Zj, that the areas of the component 
curves are of the area of the whole curve, form the roots of the quadratic 



z^ — z — 



= 0. 



Pi - 4^3 
Stage V. The standard deviations are found from 

f^iW = /^ - Wy2 - ^iVi + Pa. 
^iW = /*2 - WVi - hVxYi + Pa- 




so 84 88 92 96 100 104 108 112 grams 

Kg. 14. FBeudo-histogram analysed into two normal curves, with their sum. 
Urban's Subject I, heavier answers (fit still very poor) 

The above refers to asymmetrical frequency curves. For symmetrical 
curves (where the components have the same axis) modifications are 
required. 

On carrying out this very troublesome calculation with the hetero- 
geneous data of Urban's Subject I, heavier answers, the nonic was found 
to have three possibly usable roots: but two of these led to imaginary 
quantities, leaving only a root which led to the following analysis, 
approximately: 



First component 


Second component 


Centre 98 grama 


101-7 grams 


a 6-73 „ 


3-54 „ 


Area 250 


200 
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The two curves are drawn on the adjoining figure. On adding them 
and finding their areas between the appropriate points (84, 88 grams, 
etc.) we arrive at the following comparison : 





Urban' s Subject I, Heavier answers 


Grams 


p observed 


p' of Compound 


{p-p'rupV) 


84 


•0022 


•0039 


•001 


88 


•0200 


•0228 


•000 


92 


•0889 


•0836 


•000 


96 


•2222 


•2262 


•000 


100 


•4133 


•4938 


•026 


104 


•8966 


•8031 


•054 


108 


•9400 


•9603 


•016 




•097= sum 



X»=450x •097=4365. 

The decrease in the value of x^ obtained by dissecting into two 
normal curves is therefore only very slight indeed. P is still only about 
■000001 in value. The heterogeneity is more complex than can be thus 
dealt with. 

(8) CONCLUSIONS 

Further analysis is useless. For since there are only seven points 
given by experiment in the curve, and also the total area, it is clear 
that an exact fit could be obtained by two skew curves, or a skew and 
a normal, which have between them eight constants; or in many ways 
by three normal curves. 

All we can say is that this subject's sensitivity oscillated between 
at least two states; and if only two, then one of these is auch as to give 
a skew curve. Without attempting to get an exact fit, it is clear after 
our experience, that provided we supply a flat normal curve to give 
the awkward tails, the surplus of the distribution could be fitted success- 
fully by a skew curve. 

This suggests that the two components into which the heterogeneous 
data are thus divided are (a) a component due to erratic answers giving 
a wide shallow distribution and (6) a component really corresponding 
to the conditions of the experiment. 

The use of "catch" tests in threshold determination is to check 
component (a). The proper way to employ them is to cancel sittinga 
at which numerous catch errors occur, the subject being then pre- 
sumably in an abnormal state. 

In Riecker's experiment many "catch errors" were present, i.e. 
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occasions when the subject answered two to a one-point stimulus. Fitting 
the best normal curve to his data, we only get, on testing its goodness 
of fit, P = -0003. 

Riecker's Data 

Mean 2-02, (t :=1 -46 Paria Lines 



Paris 
Lines s 


Observed p 


s-2-02 


«-2-02 
1-46 


Gaussian p' 


p-p' 


(P-pyipr 





•00 


-2^02 


-1-38 


-084 


-■084 


•092 


0-5 


•10 


-1^52 


-1-04 


•149 


-•049 


•019 


1 


•14 


-l^02 


-0-70 


•242 


-102 


•057 


1-5 


•40 


-0-52 


-0-36 


■359 


■041 


■007 


2 


•65 


-0-02 


-0014 


•494 


•156 


■097 


3 


•80 


0-98 


0-67 


•749 


•051 


■014 


4 


■87 


1^98 


1-36 


•913 


-•043 


.023 


5 


•96 


2-98 


204 


•979 


-•019 


■018 


6 


1^00 


3-98 


2^73 


■997 


•003 


■003 




Sum ^330 



x' = 100x^330, »' = 10. P= 00031. 

Compare with this the following case. The data are for the spatial 
threshold on the forearm*, and were gathered by the Method of Non- 
Consecutive Groups. Sittings containing more than a certain number of 
catch errors were rejected, the number chosen being one which when 
exceeded at aU was usually exceeded violently. Other experimental 
precautions were taken which are described in the articles cited. As a 
result it is found that a roughly fitted Normal Curve is qtiite a fair fit, 
for in 13 out of 100 cases a worse departure would be got by chance: 
and a skew curve would improve this. The data are in fact reasonably 
homogeneous. (Calculations on opposite page.) 

The conclusion of the whole matter is that we are led to believe that 
the difficulties of psychophysical experiment are such that homogeneity 
in the data is rare. For such data refinements of mathematical calcula- 
tion are out of place. The curve fitting methods here described are 
however of value in discovering the heterogeneityf . 

With increasing precautions in carrying out the experiments and 
with increasing practice on the part of the subject, it would appear that 
the data finally reach a distribution where they are fairly well fitted by 
a Normal Curve, and excellently fitted by Pearsonian Skew Curves. 

* G. H. Thomson "A Comparison of Psychophysical Methods," Brit. Journ. Psychol. 
1912, V. pp. 203—241; "Changes in the Spatial Threshold at a Sitting," Brit. Journ. 
Psychol. 1914, VL pp. 432 — 448 and B.A. Report, 1913, pp. 681—683. 

t Compare Urban's use of the Lexian Coe£Soient of Dispeision (op. cit.) and compare 
the latter -with Pearson's more significant criteria Pi, p^i S'Ud the /S^, diagram (XXXV 
in his Tables). 
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Normal Integral fitted to Thomson's Spatial Threshold Data 


Cms. 


Two-point 
answers 


Continued 
sum 


Cms. fiom 
mean 


In (T 
units 


Theoretical 
p'from 
Sheppard 


Observed p 


(p-p')VpV 





6-2 


5-2 


-2-44 


-2-29 


•Oil 


•035 


•05293 


i 


6 


11-2 


-1-94 


-1-82 


•034 


•040 


•00109 


1 


8 


19-2 


-1-44 


-1-36 


•089 


•053 


•01599 


H 


21 


40-2 


-0-94 


-0-88 


•189 


•140 


•01565 


2 


56 


96-2 


-0-44 


-0-41 


•341 


•374 


•00485 


2i 


84 


180-2 


006 


006 


•524 


•560 


•00521 


3 


105 


285-2 


0-56 


0-52 


•698 


•700 


•00002 


3i 


125 


410-2 


106 


0-99 


•839 


•834 


•00018 


4 


141 


651-2 


1-56 


1-46 


•928 


•940 


•00216 


*i 


144 


695-2 


2-06 


1-93 


•973 


•960 


•00644 


5 


148 


843-2 


2-56 


2-40 


•992 


•987 


•00315 




I 5 


843-2 


3137-2 








Sum 


= •10767 


150 


























30 


168-64 


627-44 














5-621 


20-915 






d 


«, 













d in cms. is 2^81 from an origin of 6^ downwards. 

Mean=5^25-2^81= 2-44 cms. 

2S, =41^83 

d(l-t-d)= 37-20 

1-,= 463 

Sheppard's correction 0-08 

/t,= 4-5S=<r' 

ff=2133 
or 1^067 cms. 
X» = 150x^l0767 = 16^15, 
Ji' = 12, P=013. 



PART II 
CORRELATION 

CHAPTER V 

INTRODUCTION TO CORRELATION 

A SOMEWHAT detailed account of the mathematical theory of correlation 
and of the way in which it may be usefully applied to psychological 
measurements will be found in the later chapters of this Part. The object 
of the following introductory pages is to give the reader a general pre- 
liminary view of the method, free from mathematical complications, 
and to illustrate it by means of a simple example. 

Correlation may be briefly defined as "tendency towards concomitant 
variation," and a so-called correlation coefficient (or, again, correlation 
ratio) is simply a measure of such tendency, more or less adequate 
according to the circumstances of the case. J. S. Mill, in his "System 
of Logic," distinguished a special scientific "Method of Concomitant 
Variations," which he based upon the following principle: 

"Whatever phenomenon varies in any manner whenever another 
phenomenon varies in some particular manner, is either a cause or an 
efEect of that phenomenon, or is connected with it through some fact 
of causation*." 

The instances of this principle which Mill had in mind were mainly 
cases of approximately "complete" concomitance of variation, such as 
those usually met with in the domain of Physics. In such cases, the 
conditions of an experiment admit of a high degree of simplification, 
the phenomenon, or series of phenomena, under investigation can be 
isolated with tolerably complete success, and the " irrelevant" factors can 
be reduced to a minimum. Under such conditions, when the degree of 
concomitance of the difEerent corresponding measures of the two pheno- 
mena is found to be very high, the slight deviations from complete 

* Logic, Bk. in. Ch. viii. § 6. 
B. & T. 7 
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correspondence are put down to "errors of observation" or other un- 
avoidable imperfections in the experimental method employed. 

If the correspondence is one of simple proportionality, so that the 
graphical representation of it (one phenomenon being measured along 
the axis of x, the other along the axis of y) is a straight line*, the corre- 
lation coefficient r will be unity. Example: the variation of the length 
of a metal rod with temperature. 

If the correspondence, although still approximately complete, is not 
one of simple proportionality, the graphical representation of it will be, 
not a straight line, but a curve of greater or less complexityf, and the 
correlation, also complete, wiU be measured not by the correlation 



\ 

■V 
A" 



••V 



:* 



■A'ti 









«*>V;j^ 



Kg. 16 

coefficient r, but by the correlation ratio rj, which in this case will be 
unity. Example: the variation of the volume of a certain quantity of 
gas with the pressure to which it is subjected, the temperature remaining 
constant. A number of pairs of values P^, V^; Pg, Fg; P3, F3, etc. is 
obtained, and when plotted they are found to give a "scatter diagram" 
of the approximate form of Fig. 15. 

In this figure, the dots represent the individual pairs of observations 
P, F. They cluster very closely about the hyperbola, PV = k, repre- 
sented by the broken curve. The curve is assumed to represent the 
"real" or "true" relation of the two "variates" (as we call such 
quantities as P, F), and the slight deviations of the observed values 



* Hence the correlation is said to be "linear." 
■f Conelation said to be "non-linear" or "skew." 
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from this curve are explained as due to errors of observation and to 
other factors irrelevant to the relation under invesbigation. However 
this may be, the interesting point about the figure so far as our present 
purpose of explaining correlation is concerned is that any definite 
observed P-value is "correlated" with a plurality or "array" of observed 
F- values, and that, similarly, any definite observed F-value is correlated 
with a plurality of P-values. These arrays of observed values cluster 
extremely closely about their means (situated on the curve), i.e. their 
"scatter" or "variability," as measured by their standard deviations (a), 
is extremely small. 

The modern theory of correlation is directed towards the manipula- 
tion of observations made upon phenomena of a much greater degree 
of variability than that found in the case of isolated physical phenomena. 
The increased variability is no doubt due, in the main, to the complexity 
of factors involved. The elementary factors do not admit of isolation, 
and with reference to the concomitance of variation of the two series 
of phenomena under consideration they, as it were, pull in different 
directions. The correlation coefficient and correlation ratio measure, in 
these cases, the average extent of the concomitance. As will be explained 
more fuUy in the next chapter, r can only be taken as a measure of corre- 
lation when the average relation between the two variates is linear, and 
in this case its value is identical with that of tj. When the relation is 
non-linear r is practically meaningless, but r) still measures the relation 
accurately. 

The general problem will become clearer by reference to the ac- 
companying figure. 

Let us assume that we have a group of 200 school-children and have 
measured each of them for mechanical memory (a;) and for general 
intelligence {y). Each of the dots in the figure represents a child. Then 
if we determine the mean y-values corresponding to each successive 
"group" of x-values, e.g. Xg to a^, by assuming the observations con- 
centrated on the mid-ordinate PM*, the line AB^ drawn through these 
mean y-values (marked by crosses in heavy type) represents the law of 
change of mean y-value with increase of x and gives the "most probable" 

* The true centroid ordinate is slightly nearer the denser part of the scatter diagram, 
here slightly towards the right of PM. The correction is made later by means of Sheppard's 
formulae (see above, p. 84). 

f The means have been placed on the straight line for the sake of convenience of 
exposition. Actually, they will occur irregularly on either side of it, and AB will be the 
"best fitting" straight line, determiued by an application of the Method of Least Squares. 
See next chapter. 

7—2 
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value of y for any particular value of x. If the line is straight or ap- 
proximately straight the "regression" is said to be linear, and the 
equation to the line is 

where x, y are the mean values of all the a;'s and y's respectively {not 
the means of the arrays just mentioned), a^, a^ are the standard 
deviations of the ai's and «/'s respectively, and r is the coefficient of 
correlation. 




-2 — "3 
— *- mech. memory 

Pig. 16 

Thus if 

20 is the mean value of the mechanical memory of the group, 
30 „ „ „ „ general intelligence of the group, 

4 „ „ standard deviation for mechanical memory (a^), 
7 „ „ „ „ „ general intelligence {a^, 

and, finally, -6 is the value of r; then the most probable measure of the 
general intelligence of a child whose mechanical memory is represented 
by, say, the value 14, is given by the equation 

y - 30 = -6 X I (14 - 20), 
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whence y = 23'7. This value is the average of an array of possible 
values, whose standard deviation 

= a2V(l-'-^) 

= 5-6. 

It will be proved in the next chapter that 

where x and y are deviations from the mean (not absolute values aa 
assumed above), and S ( ) indicates summation, i.e. 

S (xy) = Xjyj^ + x^y^ + x^y^ + ... + Xj^yj^, 

where N is the total number of cases (children measured). 

It is important to note that by starting from y instead of from x, 
and determining the means of the y-arrays (such as the array within 
the limits y^^' yzVz)' another regression line, CD, is obtained different 
from the first. Its equation is 

x-x = r^{y-y), 

CT2 

and it represents the law of change of mean a;-value with increase of y. 
It gives the "most probable" value of x for any particular value of y. 

If the series of means do not lie on a straight line (approx.) but on 
a curve of greater or less complexity, the above calculation is meaning- 
less. In such a case, called a case of skew correlation and non-linear 
regression, the only measure of the correlation of the two variates is that 
given by rj, the correlation ratio, ij is the ratio of the standard deviation 
of the means of the arrays (2) to the total standard deviation (of either 
the oj's or the y's). Thus there are two values of 17, one for the a;'s, and 
another for the j/'s. They approximate closely to one another, as a rule, 
so that only one need be calculated. 

2u-t 2jn 

7] = ^^, or — . 
ffi CT2 

When the regression is linear, ij = r; otherwise tj > r. r ranges 

between the values ± 1, ij between and 1. tj is always positive. 

It will now have become clear thab the correlation ratio, rj (always) 

* Krst suggested by Bravais; shown to be the best measure by Professor Karl 
Pearson, who gave it the name of the "product-moment" formula. A. Bravais, "Analyse 
mathematique sur les probabilites des erreurs de situation d'un point," Acad, des Sciences, 
M4moires prisenUs par divers savants, IP Serie, is. 1846, p. 255. Karl Pearson, F.R.S., 
"Regression, Heredity and Panmixia," Phil. Trans. Boy. Soc, 1896, CLsxxvu. A, 
pp. 253 ff. 
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and the correlation coefficient, r (when regression is linear) are measures 
of the tendency towards concoinitant variation exhibited by two series 
of phenomena, and hence throw some light upon the causal relations 
of these phenomena. Exactly what kind of causal relation we are justijSed 
in inferring from them will become clearer in the course of the next few 
chapters. 

We may illustrate the significance of the idea of correlation in a 
slightly different (and more elementary) way. Let us suppose that the 
200 children have been arranged in order of merit, as regards mechanical 
memory, on the one hand, and as regards general intelligence on the 
other. If now it were found that each child's order was the same in 
both, i.e. that the child first in mechanical memory was first in general 
intelligence, the child second in mechanical memory was second in general 
intelligence, and so on, correspondence between the two series would be 
complete and r would equal + 1. Or if, on a second supposition, the child 
first in the one was last in the other, the child second in the one was 
next to last in the other, and so on, the correspondence between the two 
series would again be complete, but inverse, and r would be — 1. Finally, 
if there is no correspondence whatever between the two series, r will 
be zero. A value of r between and + 1 will express a tendency, greater 
or less according to r's size, for children above the average or mean 
position in the one ability to be above the mean position in the other, 
and for children below the 'mean position in the one to be below the 
mean position in the other. A value of r between and — 1 will express 
a tendency, greater or less according as r is numerically greater or less, 
for the children above the mean position in the one ability to be below 
the mean position in the other, and conversely. Now if order or "rank" 
be taken as an inverse measure of ability, the value of 

becomes ^ &S (<P) 



N (m - 1) ' 

where d is the difference between the rank of an individual in the one 
series and his rank in the other. This form gives us a general impression 
of its appropriateness for the purpose in view, since the greater the 
disparity between the two series of ranks the greater is S (d^) and hence 
the smaller is r. If there is no relation at all between the two series, 
S (d^) acquires the value it would have according to pure chance, and 
this can be shown to be N {N^ — l)/6, which makes the whole expression 
zero, as it should do. 
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The one objection to the formula is that it assumes the difference 
between any two neighbouring ranks, to be equal at all parts of the 
scale. This is obviously a false assumption; the distance of individual 
from individual at the two extreme ends of the scale must be con- 
siderably greater than that between individuals near the middle. A 
correction for this, based on bhe assumption that the form of distribution 
of the abilities in each of the cases is Normal, has been calculated by 
Professor Pearson. It is 

r=2singp), 
6S (d^) 



where p = 1 — 



N (N^ - 1) ■ 



At the end of this chapter is given a table whereby /o- values may at 
once be converted into corresponding r-values, according to the above 
equation. 

Finally, there is the question of the "probable error" (p.e.). Like 
every other constant calculated from a limited sample of variable 
material, the coefficient of correlation varies in value from sample to 
sample, and a measure is needed of the limits within which it may be 
expected with a fair degree of probability to lie. This measure is given 
by the probable error. In the case of r determined by the product- 
moment formula, when N is sufficiently large, 

•67449 ,. „, 

which means that it is an even chance that the true value of r lies 
between the limits 

•67449 (1 - r2) 



r ± 



V^ 



The chances are 16 to 1 against the value falling outside the limits 
r ±3 P.E. 

For r determined by the rank formida, the probable error is slightly 
larger, being -7063 (1 - r^)/VN. 

If N, the number of cases, be small (say, less than 30), the probable 
error is larger. Its exact size under such conditions is not known. 

The following is an example of the way in which a correlation 
coefficient may be obtained by means of ranks. The subjects were boys 
in the Fourth Form of a Public School, and the correlation to be obtained 
is that between ability in Classics and ability in Drawing. 
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Form Order 





Classics 


Drawing 






d' 


E. C. 0. 


1 


9 




(1~9)» = 


= 64 


H. G. M. 


2 


2 









B.L. 


9 


16 






49 


F. L. S. 


7 


6 






1 


C. M. S. 


3 


IS 






144 


C. J. L. H. 


5 


4 






1 


A. L. P. 


6 


17 






121 


E. G. T. 


4 


3 






1 


F. C. F. 


8 


5 






9 


N. P R. N. 


U 


14 






9 


H. B. D. 


10 


12 






4 


S. H. T. 


14 


7 






49 


H. B. M. 


12 


1 






121 


L. H. S. 


13 


8 






25 


J. P. 0. 


15 


10 






25 


E.W. 


16 


18 






4 


C. C. M. 


17 


11 






36 


L. H. W. 


18 


13 






25 


E. M. J. 


19 


19 









N=19 










688=5 (d2) 


p=l- 




I)-'- 


6 
19 


X 688 
X 360 


= •40, 




:. r=2 




= 


•416, 






•7063 (1 - r2 

P.E. = ^== 


)^ 


= -134. 





' ' Vn 

r is here just over three times its probable error, and we might 
therefore feel inclined to conclude that it proves a real correlation 
between the two series. We must remember, however, that 19 is a very 
small number of cases, and that therefore the real probable error is 
considerably larger than that given by the formula. Hence the reality 
of the correlation is not so certain. Our caution is proved to be justified 
when we turn to the next higher form, the Remove, and find that, with 
the same number of boys, the correlation between ability for Classics and 
Drawing ability works out as — -313 (± ■14), quite a different result. lb 
might be objected that other factors than mere smallness in the number 
of cases were responsible for the difference; e.g. that the tendency to 
specialise in Classics was greater in the Remove than in the Fourth, and 
that the consequent neglect of Drawing by the abler boys lowered the 
correlation. To this it may be replied, firstly, that the drawing-master 
was the same for both forms, and was likely to get as much out of the 
boys as possible in each case, and, secondly, that the difference between 
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the two forms in respect of the degree of specialising tendency was in- 
sufficient to account for the disparity of the results. 

The correct way to compare the results mathematically is to deter- 
mine the probable error of their difference. This = the square root of 
the sum of the squares of the probable errors of each*, i.e. 



P.E.„^ = Vf.E.„^ + F.E../, 

which, in this case, = V-IS^ + -14^ 

= •19. 

The difEerence = -416 + -313 = -73, nearly four times the size of its 
probable error = -Id. 

A very important extension of the theory of correlation is the con- 
ception of "partial" correlation. If, e.g., three mental abilities are 
correlated with one another, it is of interest to know how closely any 
two of them are correlated with one another for a constant value of the 
third. Such a coefficient is written, in Yule's notation, ria.a- 

This may be illustrated from our example by taking the form order 
for English into consideration in addition to that for Classics and that 
for Drawing. The correlation between Classics and English works out 
as -78, that between Drawing and English, as -21. 

Then the correlation between Classics and Drawing for "English 
constant" is 

'I'CD — ''^OB^DE 



rCD.E '■ 

y^i- — rcE') V V-i — roE' 

.AO ^ .70 V .01 

= •42. 



Va-rcBi')V{l-rDE') 
•42 - •TS X ^21 



V(l - -782) V{1 - -21^) 

Thus in this particular case, the "partial" coefficient is practically 
identical with the "entire" coefficient. 

If therefore boys were selected, out of a population of which 
the actual form is a random sample, so as to be all equal in their 
"English" ability, the correlation between their "Drawing" ability and 
their "Classics" abUity would be unaffected. Of course such a set of 
boys, in addition to being all alike in BngHsh, would be less scattered 
in both Classics and Drawing (especially in the former) than are the 
boys of the actual form, and their average ability in these subjects 
would be higher or lower than that of the actual form according to the 
level of ability in English at which they had been selected. 

On the other hand the partial correlation of English and Drawing 

* See p. 24. 
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for "constant Classics" will be found to be — -2, so that selection for 
Classics creates a negative correlation between English and Drawing, in 
so far as we can judge from this particular case. 

The reader must be warned against the temptation to draw deduc- 
tions as to the "common factors" uniting any of these pairs of subjects. 
The fallacy underljdng such reasoning is discussed in pp. 139 — 145. 

The principle of partial correlation can be extended to include an 
indefinite number of variables, and general formulae for this purpose 
will be given in Chapter VII. 

It is obvious that when the subjects in the group examined are not 
all alike in respect of some irrelevant factor such as age, these same 
formulae can be employed to ascertain what the correlations would be 
in a group which was homogeneous with regard to the factor in question. 
Great care has to be taken in interpreting the results of such calculations 
however, as fundamental assumptions may not be satisfied. This will 
become clearer in the following chapters. 

Table for converting p into rfr = 2sin^pj*. 



p 


r 


P 


r 


P 


r 


P 


r 


■05 


•052 


■30 


•313 


•55 


•668 


•80 


•813 


■10 


•105 


•35 


■364 


■60 


•618 


•85 


■861 


■15 


•157 


■40 


•416 


•65 


•668 


•90 


■908 


■20 


•209 


•45 


•467 


•70 


•717 


•95 


■954 


■25 


•261 


•50 


■618 


■76 


•765 


100 


1-000 



* Quoted from K. Pearson, F.R.S., Drapers' Company Research Memoirs, Biometiic 
Series, IV. 1907, p. 18. 



CHAPTER VI 



THE MATHEMATICAL THEOEY OF CORRELATION 

Correlation coefficient r — Correlation ratio i; — Probable errors — ^The normal correla- 
tion surface and its properties — Other methods of determining correlation — ^Fourfold 
table — ^Method of contingency — Two-row table — Short methods — The method of 
ranks — Spearman's foot-rule — Correlation of sums or differences. 

In the present chapter an attempt will be made to summarise briefly 
the principal methods in use for obtaining a measure of the correlation, 
or tendency towards concomitant variation, of two or more variates. 

Let the coordinates of the dots in the accompanying diagram — 
commonly known as a "scatter diagram" — represent the measures of 
two separate characteristics, e.g. speed of adding figures (a;) and accuracy 
of adding figures {y), in a number of individuals (iV). 



I 










O speed of adding (x) 

Kg. 17 

Let the crosses represent the mean values of y corresponding to 
values of x lying between the limits of pairs of successive units of measure- 
ment. Then the broken curve CC passing through these crosses repre- 
sents the most probable law of relationship between speed of adding 
and accuracy of adding, and is known as the regression curve. (In 
practice the crosses do not lie so accurately on the curve.) 

(1) CORRELATION COEFFICIENT (r) 

What we chiefly want to know, however, even when the regression, 

as here, is not linear, is (1) whether large x is on the whole associated 

with large y, etc., (2) how to find roughly the mean y associated with 

given X. To do this, we find the "best fitting" straight line, LL', to the 
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swarm of dots in the figure, using, merely from motives of convenience, 
the Mebhod of Least Squares*. 
Let the equation of LL' be 

r = h^X + c. 
Then applying Least Squares will make S {y — Y)^ a minimum, where 
y is the ordinate of any dot, and Y the ordinate of the line at the same 
abscissa, which is both X andx. There will be as many equations 

{y-Y)=-y- (M + c) = v 
as there are dots, and the v'a correspond to the "residuals" of the 
Method of Least Squares, though here they are real deviations and not 
errors. The Normal Equations for 631 ^^^ c, formed according to the 
rule on p. 45, give at once 

S (xy) - 621 S (a;2) - cS (x) = 0, 
S{y) -h^Six) -cS{l) = 0. 
If there are N points, S (1) = N; and if the x's and y's are measured 
from the mean of the whole, S{x) = S (y) = 0. The equations then become 

S{xy)^b^iS{x^), 
Nc = 0. 
Thus c is zero, that is the line LL' goes through the mean. And 621) 
which is the tangent of the slope of LL', is 

S(xy)_S{xy) 
^^ ~ S (a;2) N^i^ ■ 

The line LL' is known as the regression line, and 621 as the coefficient 
of regression of y on x. If we define r as 

, S{xy) 









Na^a^' 


then 




hi 


fa 


and the 


1 equation to LL' is 










Y = 


r^^X, 


xandy 


being measured from theii 


; mean values, 


An i 


analogous equation 










Z = 


-r^Y 



gives the regression of x on y. There are thus two regression lines. If 

* G. Udny Yule, "On the Significance of Bravais' Formulae for Skew Correlation," 
Proc. Boy. Soc. 1896, lx. pp. 477 — 489. 
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X and y, in addition to being measured from their means, are also 
measured in terms of their standard deviations as unity, the regression 
equations become 

Z = rr and Y = tX, 

and r is then itseK the coefficient of regression oiy onx and of x on y, 
the two regressions being equal. 

Since y — Y measures the distance in the y direction of any point 
from the regression line, the quantity 

S(y- Yf 
N 
gives the mean square deviation, in the y direction, of all the points 
from the regression line, 

S(y-b,,xr S(y^) ^. S(xy) S{x^) 

= o^^ (1 - r2). 
Hence the standard error or deviation made in estimating, by means 
of the regression equation, the value of y most probably associated with 
any particular x, is, on the average, given by 

a^ X V(l - r% 
and if the distribution is Normal it has this value not only on the average 
but for each array. 

r is known as the coefficient of correlation, and evidently must lie 
between the values + 1 and — 1. If the regression line coincides with 
the regression curve, within the limits of errors of random sampling, — 
in other words, if the regression is linear — r is a measure of the degree 
of dependence between x and y. When r = ± 1, the points close up upon 
the line and the "scatter diagram" contracts to become the line itseK. 

The formula r = |^ 

iVcTiaa 

is implied in Bravais' work of 1846, and was shown by Professor Karl 

Pearson in 1896 to be the best measure of r. Hence it is known as the 

Bravais-Pearson Product-Moment Formula. It may be written 

S{xy) 

''-VS(x^)VS(y^)' 
the denominator being the geometrical mean of the two second moments, 
and the numerator the ^oc^Mci-moment, of x and y. 



no 
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If X and y are not measTired from their means, but from some con- 
venient point distant rfj from the x mean and d^ from the y mean, the 
arithmetic is very considerably lightened, and the formula becomes, as 
may be tested by simple algebra, 

„_ S (xy) - Ndid^ 

V{S {x^) - Nd^^} V{S iy^) - Nd^^} ■ 

The following example is intended to show one form of calculation 
based on this formula. Being only a model for this purpose, it is kept 
short so that the arithmetic can be easily followed. But it must be made 
quite clear that really to calculate correlations with only ten cases is absurd, 
for the probable errors are enormous and moreover are unknown (see 
p. 114). A calculation with larger numbers, and made by a slightly 
different process, is given on p. 115 et seq. 

In the following table A and B are the percentage errors made by 
certain cadets in a test in judging distance, in the years 1915 and 1916 
respectively. 



Name 


A 


B 


x={A-16) 


y = (B-l3) 


a;* 


y* 


xy 


Cadet A 


15 


15 


-1 


2 


I 


4 


- 2 


.. B 


19 


10 


3 


-3 


9 


9 


- 9 




. c 


11 


14 


-5 


1 


25 


1 


- 5 




. D 


17 


16 


1 


3 


1 


9 


3 




. E 


8 


18 


-8 


5 


64 


25 


-40 




, P 


14 


12 


-2 


-1 


4 


1 


2 




. G 


24 


14 


8 


1 


64 


1 


8 




. H 


28 


11 


12 


-2 


144 


4 


-24 




, K 


7 


8 


-9 


-5 


81 


25 


45 




, L 


14 


14 


-2 


1 


4 


1 


- 2 


Means 15-7 


13-2 




Sums 


397 


80 


-24 


Froyiaional 16 


13 




Subtract 


0-9 


0-4 


- 0-6 (i.e. Ndd) 


dx=-< 


i d, = -2 






3961 


79-6 


-23-4 















23-4 



-v/(396-l X 79-6) 



= -•13. 



(2) CORRELATION RATIO (i;)* 

It is clear that if the regression is not linear r ceases to be a satis- 
factory measure of the relation between the two characters under con- 
sideration. In an extreme case, such as that shown in the accompanying 
diagram, r may be zero while there is yet a very close relation between 
the two characters. 

Clearly, if the individual observations, i.e. the dots in the figure, are 

* See Drapers' Company Besearch Memoirs, Biometrio Series, n. p. 9 et seq. Karl 
Pearson, "On the Theory of Skew-Correlation and Non-Linear Regression." 
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all exactly situated on the regression curve, the quantity y is an exact 
mathematical function of x, and correlation is perfect, or 17 = 1, where 
7j is a new and as yet undefined measure of correlation, called the 
correlation ratio; while if the individual observations are much scattered 
right and left of anyone walking along the regression curve, the correla- 
tion is imperfect, and ■»; < 1. 

If there is no scatter at all in any array, then the correlation is 
perfect, and the greater the scatter the less the correlation, i.e. the less 
certain is any prediction of y from x. 

Professor Pearson therefore makes the correlation ratio -q depend on the 
amount of scatter in the arrays. Exactly, he makes it depend on the mean 
of the weighted squares of the standard deviations of the arrays, i.e. upon 






m 



•\^ 






Fig. 18 

The correlation ratio rises and falls as this quantity falls and rises, n^ 
is the number of cases in the array of which a„^ is the standard deviation. 
Clearly the unit in which Oa^ is measured for this purpose must 
depend on the standard deviation of the whole of the y's. If the arrays 
are less scattered than the whole, there is correlation. If there is no 
correlation, any array will have just as much scatter as the whole has. 
Pearson writes 






= 1- 



r 



Compare this equation with that found above for the case of linear 
regression, p. 109, namely 

"average" mean square deviation of an array = a^ (1 — r'^), 
which in our present notation is 

-.a ^ ^^ 

CTz 
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and from the comparison we see that Pearson has chosen i^ so that it 
becomes r^ for linear regression. The correlation ratio tj however never 
becomes negative but is in linear regression niunerically equal to r. 

The above formulae already allow of the calculation of rj, but a 
simplification is possible. By its definition, 

S' being a summation up and down an a/rray, so that 

o^^ = SS'{,y,-y,flN. 
S being a summation at right angles to the former, i.e. a summation 
of the arrays. That is to say, the mean of the weighted squares of the 
standard deviations of the arrays is the same thing as the mean square of 
the distances of the dots (measured in the y direction) from the regression 
curve. This simplifies to 

Na^^ = SS' (y/) - 2SS' (y,y,) + SS' {y^) 
= Na,^ - 2S {%S' (y,)} + S (n,yj^) 
= Na,^ - 2S {yA%) + -S {n,%^) 
= Tffa^ - S (n,y/) 
= Na^^ - KE^^ 

where Sj is the standard deviation of the means of the arrays, each array 
being weighted with the number of cases in it. Therefore 



^ 

•^2^' 





<=o■,2-s,^ 


r,^ = 






. ^2 



'n = —- 



The correlation ratio, therefore, is the ratio of two standard deviations, 

one of the means of the arrays (properly weighted), the other of the whole. 

Starting from the other variate, we arrive in a similar way at a 

second value y, 

^ — -^1 
rq = — . 

Since 17 is the ratio of two standard deviations, it must always be positive. 
Let Y be the ordinate of any point on the regression line, then the 
average of the sum of the weighted squares of the distances between 
the regression fine and the regression curve 

_ S{n,(y,-Yf) 

N 

which reduces to ctj^ (tj^ — r^). 
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Thus t) must always be numerically greater than r, except in the 
case of linear regression, when it is numerically equal to r. 

In examining the relationship between two measurable characters, 
Tj should be calculated as well as r, since it serves as a test of the hnearity 
or non-Unearity of the regression, and is also a better measure of causal 
relation than r. 

A simple criterion for linearity which is very generally apphcable is 
that ,_ 



•67449 



,|vV^^2<2-5*. 



For very exact work, more compUcated formulae need to be em- 
ployed. 

The results obtained above are all independent of the forms of dis- 
tribution of the variates. 

(3) PROBABLE ERRORS 

In determining means, standard deviations, and other frequency 
constants, the investigator is unable to work from the " total population " 
and must be content with the results obtained from " random samples " 
of greater or less size taken from this (in some cases, hypothetical) total 
population. 

When the number of cases (n) in the random sample is fairly large — 
so large that fractions containing certain higher powers of n in the 
denominator can be neglected — the probable errors are found to be as 
f oUowsf : 

p. E. of a mean = -67449 —= , 

Vn 

„ „ a = -67449 -^ , 

„ „ r= -67449-^ 



Vn 

The second and third of these values are only correct when the 
frequency-distribution is normal or approximately normal. In parti- 
cular, for large values of r the true p.e. may be considerably different 
from that given by the above formula unless the distribution is normal. 

1—7}^ 

P.E. of 11 = -67449 — y=J- , for hnear regression, and also, as a rough 
Vn 

* J. Blakeman, Biometrika, rv. pp. 349, 350. 

t See W. Gibson, "Tables for Facilitating the Compntation of Probable Errors," 
Biometrika, rv. p. 385 et seq. and Pearson's Tables. 

B. & T. 8 



114 CORRELATION [PT. ii 

measure, for cases of skew correlation. If greater exactitude is needed 
in the latter cases, more complicated formulae have to be employed*. 
Another frequency constant in common use is the coefficient of 

variatton V, which = . 

mean 

Its P.E. = 



67449f|i + 2 (j^)''}yv^t- 



As stated above, the values just given for the probable errors only 
apply in cases where n is fairly large. In cases where n is so small that 
certain higher powers of its reciprocal cannot be neglected in comparison 
with the rest of the expressions involving them, the values cannot be 
used. For such cases no theoretical formulae have hitherto been devised. 

An empirical investigation has however been madef on samples of 
4, 8, and 30 cases, taken from a "total population" of 3000 pairs of 
measurements (height and left middle finger measurements of 3000 
criminals; "real" correlation, -QG). From the results obtained it may 
be concluded that, although in the case of such small samples as 4 or 8 
the ordiijary formula for the probable error of r gives much too low a 
value, yet in the case of as many as 30, the formula applies with tolerable 
accuracy. We must, however, bear in mind that this result has only 
been proved (empirically) to hold in the single case when the actual 
correlation was -66. 

The calculation of the probable errors of means, standard devia- 
tions, coefficients of variation, and coefficients of correlation is very 
much faciUtated by the use of Pearson's Tables for Statisticians, es- 
pecially Tables V, VI, VII and VIII, calculated by members of the 
staff § of the Biometric Laboratory, University College, London. 

We give next an example of the evaluation of r and of ij between 
speed of adding single digits and accuracy in doing so, the individuals 
measured being 86 boys between the ages of 11 and 12 years from two 
L.C.C. elementary schools. The two groups could be thrown together 
for this purpose, since the means and standard deviations calculated 
from them separately were in very close agreement — well within the 
limits of the probable errors. 

* Karl Pearson, op. cit. (Biometric Series, n.) p. 19. 

t Calculated values for difierent values of n given in Gibson's Tables, see pp. xxii 
and 18 of Pearson's Tables. 

% " Student " ■■ "TheProbable Error of a Coefficient of Correlation," Biometnka,190S — 9, 
VI. p. 302. 

f Miss Winifred Gibson, Dr Raymond Pearl, T. Blakeman, Dr David Heron, Misa 
H. Gertrude Jones, H. E. Soper, 
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Frequency 


x' 


Frequency 

•X.X' 


Frequency 


Frequency 


y' 


Frequency 
■x.y' 


Frequency 

•x-y'* 


3-5 
18-5 
14-5 


-3 
-2 
-1 

1 
2 
3 
4 


10-5 

37 

14-5 


31-5 

74 
14-5 

15-5 
42 
22-5 
48 


6 

4 

3 

9 

18 

25 

21 


-5-5 
-3 

-2 

-1 



1 
2 


33 

12 

6 

9 


181-5 
36 
12 


18 


-62 


9 


15-5 


15-5 
21 
7-5 
12 


-60 


— 


10-5 
2-5 


25 
42 


25 
84 


3 


m=N 


+ 67 


347-5 


86=Ar 


+56 


248 


+ 7 






-6 





e^i = - ^ = - 0-06977, 

= 2-7955, 
ai = 1-67. 



86 
347-5 



<^a = ^ = 0-0814, 



09 = 



2 ~~86" ^ 
= 4-0341, 
.-. cTg = 2-013. 



4n, 



Frequencies 


x'y'X 


Total 
frequency / 


/xa:y 


1+5-5 -4-5 -1-5 


1 


0-5 


+ 0-5 


2-5 + 0-5+2-5 + 3-4-5-5-2 


2 


-3 


-6 


1+0-5 +0-5-1 


3 


1 


3 


2+2-5 + 0-5-2-5 


4 


2-5 


10 


0-5-1 


6-5 


-0-5 


-2-75 


1+2-1 


6 


2 


12 


0-5 


8 


0-5 


4 


1 


9 


1 


9 


3 


11 


3 


33 


-1 


22 


-1 


-22 




71-5-30-75 




^f(!cy) =40-75 



* Sheppard's correction. Remember that a^=iJ.i, and d^v-^'; see above, p. 84. 
t Sheppard's correction cannot be used here, since the units of the subgroups are 
not equal and there is not high contact at the ends of the frequencies. 
X The figures in italics in the correlation table. 
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r = 



S (xy) = S {x'y') - Ndj^i^ 
= 40-75 + 86 X -0057 
= 41-24, 
S{xy) 41-24 



P.E. = 



86 X 1-67 X 2-013 
1-/2 



0-143, 



-67449 



0-071. 



J-epeed of addition = 0-14 ± O'OT. 



ace. of addition 



Vx -y 


{yx-y)''^'^x 


-0-224 


-050176 X 3-5= -175616 


- 1-081 


1-168561 X 18-5=21-618379 


0-488 


•238144x14-5= 3-453088 


0-322 


-103684x18 = 1-866312 


0145 


•021025x15-5= -325888 


0-490 


•2401 xl0^5= 2^52105 


1-719 


2-954961 X 2^5= 7-387403 


- 1-414 


1-999396 X 3 = 5-998188 




S {nJy^-y)}'=iZ-Si592i 



43-345924 



= 0-1249, 



86 X 4-034 
.-. ij = 0-353, 
P.E. = -67449 (1 - yfWN 
= 0-064. 

•'• 'Jspeed of addition = 0'35 ± 0'06. 
aoc. of addition 

Calculating ij from the means of the ^/-arrays, we have 
g _ <S {w„ {Xy - xf} 

19-68101 



86 X 2-796 
= -0819. 
.-. ij = 0-29 ± 0-07. 

To test the value of t] obtained from the means of the x-arrays, for 
linear regression : 

VF , . ,-„ ^. -323 



•67449 



Vr,^ 



-a* 



2 X -07273 
= 2-22, i.e. <2-5. 

Hence regression may be considered to be linear. 

* See p. 113. 
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Regression coefficients: 



612 = r -i : 



'-='?- 



•118, 
•172. 



Equations to regression lines are 

a; - « = 612 (y - y), 
and y-y = hi{^ — ^)- 

:. equation to regression line AB is 

■ y - 164-2 = -172 (a; - 237-7), 
i.e. y = -172a; + 123-316 



237-7 (mean speed) 



.(i). 




T40 mJ 220" 280 800 840 380 

speed (no. of figures added in 10 minutes) 

Kg. 19 

Similarly the equation to the hne CD is, 

a; -237-7 = -118 (y- 164-2), 
i.e. X = -llSy - 218-324 (ii). 

Equation (i) gives the most probable value of y associated with a 
given value of x, with a standard error 

az^/{\-r\ i.e. 1-991. 

Similarly, mutatis mvtandis, with equation (ii). 

As a model of the proper use of the correlation coefi&cient and ratio 
we may cite a very detailed investigation into the relationship of in- 
telligence to the size and shape of the head, and to other physical and 
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mental characters, by Prof. Karl Pearson*, which appeared in Bio- 
metrika, v. 1906 — 1907. The subjects measured were 1000 Cambridge 
undergraduates and considerably more than 5000 school-children. Special 
care was taken in drawing up a quantitative scale of intelligence, ad- 
justments being made so that the results fitted a "normal" or Gaussian 
distribution. The correlations were worked out in several different ways, 
— correlation coefficient (r), correlation ratio (ij), coefficient of mean 
square contingeneyf, and the method, first suggested in this paper, of 
the analograph. Pearson describes this last method as follows : " In the 
case of intelhgence, I take a normal scale as my base Une and plot up 
the percentage of the character for each grade of intelhgence along the 
centroid vertical of the corresponding range, drawing a horizontal line 
to represent the mean percentage in the population at large. We thus 
obtain a diagram, which I will venture to call an analograph. 

" If the percentage increases or decreases continually with intelhgence 
(or with the base character, whatever it may be), I term the relationship 
homoclinal; if the percentage does not reach its maximum with the 
maximum or minimum of intelhgence, I term the diagram heteroclinal." 

(4) THE NORMAL CORRELATION SURFACE AND ITS PROPERTIES 

We have so far considered the correlation coefficient r from the point 
of view from which it was approached by Galton, who measured it by 
the incUnations ^^ and 9^ of what we have called the regression hues, 
according to the formulae 

r = -^ tan 6^= — tan 6^ = VC^an 6^ tan ^g). 

We followed this up by an appUcation of the Method of Least Squares, 
first made by Mr Udny Yule, obtaining the more advantageous formula 

known as the Bravais-Pearson Product-Moment Formula. All we have 
so far done is independent of the form of distribution of the correlated 
variates. 

From a historical point of view however this is not quite the way in 
which the subject developed. In 1846 A. Bravais pubhshed, in Vol. ix 
of the Memoires de TInstitut de France, an article entitled "Analyse 
mathematique sur les probabihtes des erreurs de situation d'un 

* Karl Pearson : " On the Relationship of Intelligence to Size and Shape of Head, and 
to other Physical and Mental Characters," Biometrika, 1906 — 1907, v. pp. 105 — 146. 
•f See below. 
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point," which was really a very complete study of normal correlation, 
although this is not perhaps obvious to anyone casually glancing through 
the volume. (He uses the word "correlation" itself, however, on p. 263.) 

Galton's work was done in ignorance of this article, and of course 
was work applied directly to certain social and anthropological problems, 
whereas Bravais' is a piece of mathematics only. Also in ignorance of 
Bravais' work, Professor F. Y. Edgeworth* took some important steps 
on the road later followed by Professor Karl Pearson, who connected 
Galton's work with Bravais', adopted the product-moment formula 
which was impUcit in the latter's equations, and showed that it is the 
best formula for the purpose, i.e. it has the least probable error. 

Mr Udny Yule still later pointed out the simple method of arriving 
at this formula which we have used in an article of the Proceedings of 
the Royal Society for 1896 — 7, the chief importance of which is that it 
points out the fact, then first adequately reahsed, that the product- 
moment formula has a definite significance even if the distribution of 
errors is not normal. 

In the present section we shall now consider more definitely normal 
correlation. 

If the X variate is normally distributed according to the law 

V{27r) <Ti 
and the y variate according to the law 

vm^/^' '^'' 

then the probability of simultaneous occurrence of a value x and a value y is 

P = „— ^e 2W= '^i'Jdxdy (3), 

provided x and y are independent or uncorrelated. If however they are 
correlated, this probabiUty is 

1 / x'^ irxy ^ g/' \ 

^ 27r<7i<72v'(l-»-2) '^'''^y ^^>' 

which reduces to the former expression when r, the coefficient of 
correlation, becomes zero. The surface 

z = P'/dxdy (5) 

* Phil. Mag. 1892 and 1893, several articleB. In a paragraph buried in one of these, 
indeed, Professor Edgeworth reached, but did not realise the importance of, the product- 
moment formula, which he there gives as the best formula {Phil. Mag. July 1893, Series 5, 
xxxvi. on p. 100). 
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is the normal correlation surface, and is a hillock shaped Uke a bell 
with an oval mouth. 

When two variates are recorded on a grid-iron table Uke that used 
in the above example of Speed of Addition and Accuracy of Addition, 
the resulting table is called a "correlation table*." As, owing to the 
experimental difficulties of the subject, psychological correlation tables 
are seldom smooth enough to illustrate vividly the points about to be 
mentioned, we quote here a correlation table from the study of heredity 
which has already been used elsewhere for a similar purposef. 

Father's Stature 





Inches 


59 


60 


61 


62 


63 


64 


65 


66 


67 


68 


69 


70 


71 


72 


73 


74 


76 




60 










2 


2 


4 
























61 










2 








4 




















62 




1 


1 




2 


4 


1 


1 


2 


2 
















63 


. 


1 


1 


9 


9 


8 


16 


20 


11 


5 


. 


1 


1 












64 


4 


. 


6 


16 


12 


17 


32 


37 


12 


5 


6 


3 


5 












65 


8 


4 


2 


8 


13 


38 


54 


43 


30 


22 


14 


10 














66 


. 


2 


4 


9 


21 


38 


40 


67 


70 


64 


21 
50 


8 


10 


4 








t 


67 




6 


8 


19 


14 


55 


79 


106 


103 


78 


55 


13 


2 


* 






1 


68 






6 


8 


30 


40 


41 


97 


126 


94 


118 


53 


34 


38 


9 






CD 


69 






4 


. 


21 


20 


51 


73 


64 


96 


116 


86 


40 


14 


9 




* 


1 


70 










4 


10 


23 


75 


47 


78 


90 


78 


58 


25 


14 


6 


4 




71 












13 


20 


35 


43 


76 


59 


83 


43 


32 


20 


4 


4 




72 












1 


12 


5 


28 


31 


43 


45 


40 


34 


11 


2 






73 














3 


3 


10 


30 


26 


24 


30 


25 


13 


2 


2 




74 










4 




6 


6 




21 


9 


10 


26 


13 


13 




8 




75 




















4 


8 




10 


3 


7 


2 






76 




















5 


1 




2 


4 


4 








77 




















5 


1 


4 






6 








78 






















4 


4 




1 


3 








79 




























1 


1 







For simplicity in printing, the numbers in the original table in Biometrika, 1902 — 3, 
□. p. 416, nave been multiplied by four: this eliminates quarters and halves which occur 
through some heights being half-way between whole inches. 

* It is very unfortunate that Mr J. C. Maxwell Gamett has recently used this term 
(already fixed in the meaning given in the text) for something quite different. Proe. Boy. 
Soc. 1920, xcvi. A, p. 100. 

t E.g.' Mr Udny Yule's text-book on the Theory of Statistics; and, independently, 
by 6. H. Thomson, "Mathematics and the Inductive Methods of Logic," Proc. Univ. 
Ihirham Phil. Soc. 1912—13, v. pp. 76—99. 
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It is interesting to look at a correlation table in greater detail. If 
we think of it as a plane horizontal surface, and erect over the centre 
of each compartment a vertical hne proportional to the number written 
in that compartment, then the tops of these lines touch the correlation 
surface. 

It is clear from the equation to the surface that the contours, or 
lines of equal 2, are ellipses, given by the equation 



2ra;t/ ^ y' 



= constant 



(6). 



In the figure the numbers 40 and over have been printed in italics so 
that this contour line can be approximately followed. It is seen to be 
roughly elUptical, the major axis of the elhpse lying obhquely. The 
major axes of all the contour ellipses of a surface showing correlation 
are inclined to the axes of coordinates; and if, as we always can, we 
choose the linear units of x and y on the diagram in such a ratio that 



/i. 





Rg. 20 



Fig. 21 



a^ = CTg) then this incUnation will be at 45°, as shown in the accompanying 
figure (Fig. 20). 

In the case of no correlation, r = 0, and if a^ = a^ the equation (6) 
given above for the contour Unes represents a circle. In this case, 
therefore, with suitable units of x and y, the contours are circles, and the 
correlation surface is a perfectly symmetrical hillock. As correlation 
increases, the contour lines become more and more drawn out along 
the 45° line (or the 135° Hne for negative correlation), as suggested 
in the figure (Fig. 21). 

In a contoured plan of a correlation surface, those lines through the 
origin are important which cross all contour lines at points where the 
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tangents to the contour lines are parallel to the axes of coordinates. 
Such are i^i^ and L^^ in Fig. 20. If we differentiate equation (6) 
with regard to x and equate to zero we shall obtain L^^. We find 

2a; Iry 



= 0, 



or simplifying x 

Similarly the equation of L^L^ is 



y 

<^2 



y _ X 

That is, these are the regression lines. 

We have said that any horizontal section of a normal correlation 
surface is an elUpse. Any vertical section, on the other hand, is a normal 
probabiUty curve. Consider first a vertical section parallel to the ai-axis. 
Write y = c in equation (4). 

After a Uttle simple algebraical arrangement this then reduces to the 
form j2 , 

V(2^)<^iV(i-'-') 

This is a normal curve, with area 



Its centre is at cc = rca^la^, y = c, i.e. on the regression Une. Its 
standard deviation is a^ \/(l — r*) and is independent of c. 

Similar statements hold for a vertical section along a line x = c', 
the constant standard deviation being a2\/(l ~ '■^)- 

A similar procedure will show, with rather more cumbrous algebra, 
that any vertical section is a probabihty curve. 

(5) OTHER METHODS OF DETERMINING CORRELATION 
1. Fourfold Table. 





^ 


Xt 




Vi 


a 


b 


a + b 


Vi 


c 


d 


c+d 




a+c 


b + d 


N 
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(a) When the divisions pass through the means of both characters, 

. TT (a — b) 
r = sin H 7 — TTT . 
2 (a + b) 

This formula (Sheppard's) is of Kttle practical use, since the mean 
values, in cases where the fourfold table is the only method which can 
be used, are generally unknown. 

(6) In cases where the dividing lines do not pass through the means 
(means not known) the equations for determining the correlation are : 




a 



Fig. 22 



and 



(ffl + c) 



talno.) VttJo 



i\r (the total 

(the cross-hatched area on the upper curve) 



(a + b)-(c + 



N 



^''VUl' 



'dy 



(the cross-hatched area on the side curve) 
from which h and k can be obtained by Sheppard's Tables ; and finally 
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ad — he r^ t^ r* 

r + ^ M + ^ Qfi - 1) (F _ 1) + L A (^2 - 3) yfc [B - 3) 



iV^HZ 



+ jj (A* - 6F + 3) (F - 6^2 + 3) 



+ 720^' 



10^2 + 15) ]c{l(i- loF + 15) 



+ J^ (h^ - 15¥ + 4:5¥ - 15) (^« - 15ifc* + 45F - 15) 



5040 
+ etc., 



where 



V27r 



1 



-i*=* 



In obtaining these equations, a normal correlation surface is assumed, 
with the equation : 

,7 1 1 (X' 2rxy y'\ 

2 = ^ e 2 1 -r" W "1", ".y 

The probable error of r obtained by the fourfold table method is 
much larger than that given by the formula of p. 113. The correct 
formula is too compUcated to insert heref . 

2. Method of ContingencyJ. 

The following is an example of a contingency table §. 

Fathers 





Merry 


Melancholy 


Alternating 


Even 


Totals 


Merry 
Melancholy 
Alternating 
Even 


122 
10 
70 
68 


8 
2 
9 
6 


81 

7 

101 

66 


67 
10 
68 
45 


278 

29 

248 

175 


Totals ... 


260 


25 


255 


190 


730 



* All this is rendered much easier by P. F. Everitt's "Tables of Tetrachoric Functions," 
Biometrika, vn. pp. 437 — 451, or Table XXIX in Pearson's Tables. 

f Unless the dichotomies are extreme, good values are obtained from the use of 
Tables XXIII and XXIV, Pearson's Tables. 

I Karl Pearson, "On the Theory of Contingency and its Relation to Association and 
Normal Correlation," Drapers' Company Research Memoirs, Biometrio Series, I. 1904; 
Dnlau and Co., London. 

§ Taken from a, paper by E. Schuster and E. M. Elderton on "The Inheritance of 
Psychical Characters (being a further Statistical Treatment of Material Collected and 
Analysed by Messrs 6. Heymans and E. Wiersma)," Biometrika, 1906 — 1907, v. pp. 
460—469, 
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The method is employed when the grouping is merely by class and 
the different classes have no known relation to one another — ^in other 
words, when the grouping is merely quaUtative. The order of the 
different quaUties can be changed without making any difference to 
this method. 

The relationship between the two variables is measured by the 
differences between the numbers actually found in the various com- 
partments of the table, and the numbers that might be expected there 
by pure chance. 

To state the rule : 

The total mean square contingency, <f>^, of the table is given by 



*'-4«. 



((—"^) 



N "^ 

where Wj, = total frequency in ^th row, 

Wj = total frequency in jth column, 

Wj,j = frequency of constituent common to pth row and gth column, 
N = total number of cases in the table. 
Then the coefficient of mean square contingency C^ is: 



«.=yr^ 



If it is assumed that a normal distribution underhes the classification, 
and if the fineness of grouping is right, then the coefficient C^ is nu- 
merically equal to the correlation coefficient r. 

In the above case, Gy = 0'16. 

The probable error of C^ is very comphcatedf . 

For Gi = 0, p.E. = -67449/-^/^^. 

Instead of calculating the mean square contingency, it is easier though 
not so accurate to calculate the mean contingency. Each quantity 

* There are CCTtain corrections to <(>', not mentioned here, wBich often make con- 
siderable difference to the result. See K. Pearson, IT.R.S., "On the Influence of Broad 
Categories on Correlation," Biometrika, 1913, ix. pp. 116 — 139. The important point is to 
have fine enough grouping, but not so fine as to leave cells with very few or no cases 
in them. 

■f J. Blakeman and Karl Pearson, "On the Probable Error of Mean Square Contin- 
gency," Biometrika, 1906, v. pp. 191 — 197. A. W. Young and Karl Pearson, "On the 
Probable Error of a Coefficient of Contingency without Approximations," Biometrika, 
1916, XL pp. 215—230. 
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is called a subcontingency, and it will be observed that in tlie formula 
for ^2 these were squared. Instead of this, let us, without squaring 
them, add the positive subcontingencies only (for of course the sum of 
the whole is zero), and write 



N>l' = S^^n^-'^Y 



Erom ip, by using Table XXXIV of Pearson's Tables, a value of 
the second coefficient of contingency Cg is read o£E, which, under condi- 
tions similar to those outlined above, also is eqmvalent to r. 

The contingency method, it must again be emphasised, gives these 
two measures of the connection or association of the quaUties considered 
even without any assumption that a continuous variation underhes 
the discrete classification. If however such is assumed, then the approach 
to equahty of C^ and C^ will be a good measure of the normality of the 
distribution and the suitability as to smallness of our elements of 
grouping. With very fine grouping we get into difficulties owing to having 
to record by units only. 16 to 25 subgroups is a good range. 

It is interesting to find that areas of positive contingency are sepa- 
rated from areas of negative contingency on a normal surface by a 
hyperbola having a simple relationship with the contour elhpses. 



3. Two-Eow Table*. 

This method gives a unique value of r in the case of two variates 
one of which is both quantitative and continuous (e.g. intelhgence), 
while the other, though quantitative, admits of only two subdivisions 
(e.g. into good and bad visuahsers), or, in more technical language, is 
" alternative." 

-*• X (intelligenoe) 





1 
















Good 
visualisers 




















Bad 

visualisers 






































N 



* Karl Pearson, F.R.S., Biometrika, 1909, vn. p. 97. 
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The assumptions made are two in number: 

(1) that the regression is linea/r: 

(2) that the distribution of the alternative variate is approxi- 
mately normal or Gaussian. 

The accompanying diagram (Pig. 23) will make the method clearer. 
LL' is the regression Une of x on y, its equation being 

x-x = r'^{y-y). 



<^2 
X 



r = : 



y-y 



i 
y 

Good 
visualisers . 



Bad 

visualiseTs 



— T 



-y^ 



^^■ 



\ 



V 



R 

i 



St 

M 



iM 



Fig. 23 
Putting X, the mean of all the as's, = p, x = p', the mean value of 
the array of x's corresponding to the smaller group of y's (good visual- 
isers), and y — y = q, the distance of the centroid of the area RST from 
the mean MM of the Gaussian curve, we have the formula : 

p-v' 



The numerator of this is known, and the denominator 

1 x" 

l-QO 

e ^ "'* xdx 



Vo 



'Xt 



02 <T2 X area of part RST 
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where w^ = no. of cases in upper group (area RST), 

and N = total no. of cases. 

In this expression ^ = |(1 + a) in Sheppard's Tables* from which the 

corresponding value of — can be read; and the value of z corresponding 

to this latter result gives i /j.^\ a 

1 "2 UJ . 



V277^ 



P-P 



„ /the value of z in Sheppard's Tables corresponding\ " 
fif \ to the I (1 — a) of ^ J 

The above method might advantageously be used in determining 
the relationship of two characters, such as those suggested (inteUigence 
and visual imagery), which are both quantitative, but of which one 
only admits of reliable division into two groups. It is the smaller of 
these groups (the "tail" of the distribution) which gives the n of the 
formula. In the suggested case, it might be represented by the number 
of individuals, out of the entire group N measured, who successfully 
pass certain somewhat difficult tests of visual imagery. 

In cases where the x-variate (continuous and quantitative in our 
example above) can only be divided into classes, showing no definite 
order or quantitative relations to one another, the y-vaiinte being again 
quantitative and assumed to follow a normal distribution, but alter- 
native, a modification of the above method gives ij. 

4. Short METHODsf . 

(i) It can easily be shown that 



and therefore that 



2e7a.(73, 



(ii) If the distributions are both normal, and if both variates have 
the same meaning, and the same standard deviation a, then 

* Biometrika, n. p. 182; Table II of Pearson's Tables. 

t Karl Pearson, " On Further Methods of Determimng Conelation," Drapers' Company 
Besearch Memoirs, Biometrie Series, vr. 1907. 

B. & T. 9 
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where S is the sum of the positive difEerences only. This method might 
sometimes be conveniently used in determining the "individual" corre- 
lation between performances of the same individuals in the same mental 
tests on different occasions. 

5. The Method of Banes. 

Some years ago the valuable suggestion was made by Professor 
C. Spearman* that measurements of psychical performance may con- 
veniently — nay, preferably — be replaced by the numbers representing the 
rank or order of merit of the individuals in the group. On this basis 

S (xy) 
the ordinary product-moment formula for r, ^ , can be easily shown 

to reduce to the form oq / vo 

P = ^~ N{m-i) ("'' 

where v^ and v^ are the ranks of an individual in the two series. 

Professor Spearman also suggested a still simpler formula, which he 
calls a "foot-rule" formula. It is 

« = 1 - ^ (AT^ - 1) (^)' 

where S(g) denotes the sum of the "gains" in rank (sum of positive 
differences) of the second series on the first, and then empirically, by 
noting the distribution of a large number of chance values of S [g), 



. = singi2) 



(y). 



This method of using ranks, and the formulae suggested therefor, 
were vigorously criticised by Karl Pearson in the paper quoted on the 
preceding pagef. 

Some form of frequency distribution must be assumed, and the 
"foot-rule" method assumes that form to be a rectangle. On the assump- 
tion of normal distribution, Professor Pearson shows that 

r=2singp) (S), 

where p has the value given above; 

* C. Spearman: "Measurement of Association between Two Things,'' Am. Joum. 
Psychol. 1904, xv; "'Foot-rule' for Measuring Correlation," Brit. Joum. of Psychology, n. 
Pt. I, July 1906. Were it possible to keep R in its place merely as a "foot-rule" whose 
"chief mission is to gain quickly an approximate valuation of r," it would not be harmful. 
But the ease of its calculation leads to its use too frequently "not merely for assay purposes 
as originally contemplated, but even sometimes for research" (see C. Spearman, Brit. 
Jmirn. of Psycltol. 1910, m. p. 286). 

t Professor Spearman endeavours to meet some of these criticisms in Brit. Joum. 
of Psychol. 1910, in. p. 271 sqq. 
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In terms of the sum oi positive differences of ranks ("gains" in rank) 
the formula is 

r=2cos27r^'^^^) ^ 



= 2cos|(l-i?)-l (c). 

This agrees very closely with Spearman's formula (y), and, having 

a definite theoretical basis, should now take its place. 

Professor Pearson has proved that the probable error of p for zero 

•67449 

correlation = . . He is thus able to show that, for zero correlation, 

Vn—1 

p.B. of r deduced from p = , 

^ Vn-l 

A T, -7738 
and p.E. „ r „ „ R= 

Vn — 1 
While this book is going through the press. Professor Spearman has 
very kindly sent on to us the proof sheets of portions of a work on 
correlation by Professor Wirth of Leipzig, in which considerable space 
is devoted to the method of ranks, of which the author highly approves 
and which he ingeniously develops in several ways. Although we cannot 
completely share this approval, we can appreciate the thorough exposi- 
tion given and especially the clear distinction drawn between (1) the 
stage of complete measurement, (2) the stage of ranks in which classes 
are arranged in order, and (3) the stage of contingency, in which classes 
of X can be assigned to classes of y, without however being in any order 
of magnitude. 

This shows clearly the position which rank methods take up. The 
controversy concerning these methods really falls into two parts, 
(1) questions of principle and (2) questions of mathematical detail, 
often of primary importance. Leaving the latter aside however, let us 
consider Professor Spearman's claim that ranking is superior to measure- 
ment in psychology because the things measured are not " homogeneous," 
for example the skin's "spatial sense" and its sensitivity to pain*. In 
short. Professor Spearman raises the whole question of mental abilities 
being "magnitudes" or true "quantities" (see Chap. I, p. 11). 

We would reply that there is much truth in what he says. But 
(1) for practical purposes the correlations of the physical signs of mental 
states are themselves often important, whether or no they are the true 
* Brit. Journ. of Psychol. 1906, n. p. 91. 

9—2 
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correlations of the mental states as such, and (2) the physical measure- 
ment often, at least, goes further than giving the mere order of merit 
of the mental states. It frequently ranks the differences also (cf. p. 12). 
Indeed, in many cases, such as the giving of examination marks, there 
is a definite attempt to do more than say that this is the order of merit 
of the competitors — there is an attempt to say something like: — "This 
candidate is easily first, the second and third are close together, the 
fourth a long way behind," and so on. While we are prepared to agree 
that actual marks, in an examination or a test, are hardly likely to be 
true measures of the desired mental state, yet we claim that they show 
often a good deal more than mere ranks. Mental measurement need 
not remain at the second of Professor Wirth's levels, even though it 
may be long before it can reach the first. 

(6) CORRELATION OF SUMS OR DIFFERENCES* 

This article is concerned with the following problem. After calcu- 
lating the correlations between several series of values, it frequently 
happens that we want the correlations given by some of the series added 
together; and differences are not less important than sums. The corre- 
lation of the pool is Tiot the mean of the correlations. The general problem 
is as follows. 

Let the two series of values be denoted by a^, a^ ... a^,, and 
6j, 62 ... hg, each being measured from its own mean and consisting of 
N cases. Let these variates be multipHed by constants or weights. 
Required the correlation between 

A = %% -I- n^a^ + ... + Wjjflj 
and B = m^b^ + m^^ + ... + mj}^ 

Since all the a's and 6's are measured from their means, it is clear that 
A and B are also so measured. The required correlation is therefore 

S' (AB) 
*" ~ ^/S' {A^) VS' (B^) ' 
where the symbol S' indicates summation from 1 to N. We shall retain 
the symbol S, on the other hand, for summation from 1 to ^ or 1 to q. 
Consider now the correlation of any particular a with any particular 6^ 
It is given by ^' (g^) 

That is, Na^a^r^, = S' (ab). 

* 0. Spearman, Brit. Journ. of Psychol. 1913, v. p. 417. 
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Multiplying the a by its constant n, and the h by its constant m, only 
alters the standard deviations of these quantities, not their means, 
since they are already measured from means. We have therefore 

iVwjwcTaCTjraj, = S' (na.mb), 

and summing this over the -p and q measurements of a and h we get 

NS (nmaga^r^) = SS' (na.mb). 

Now the right-hand side of this equation means the sum of all pos- 
sible products of na and mb. But a httle consideration will show that 
this is exactly what the numerator of r, viz. S' (AB), means. Therefore 

S' (AB) = NS (nmj^a^rj. 

The two quantities in the denominator of r are found similarly, or by 
putting A = Bin the expression just arrived at, and we have finally 

^/{S (nncracr„r^) S (mnKTiaiTa,)} ' 

wherein it must be noted carefully that the summations in the de- 
nominator include the correlations of an a or of a 6 with itself, corre- 
lations which of course are unity. Excluding these cases from the 
summations, and recognising that each case hke a,at has a twin case aja„ 
we arrive at the following form of the same formula : 

S (nmajy^rg^) 

*■ ^{S (nW) + 25 («ma„a„r^)} v'{'S (mV) + 2-S (mma,<j^rj,„)} ' 

In the form in which this formula will perhaps most frequently be 
used, j> will equal q, and both will equal 2, while the weights n and m 
will all be unity. We then have 

~~ VK/ + Oc^ + 2(Tai(ToaVoJ VWi^ + <^»i* + 2o-6iO-s,rs^6j " 

The immediate practical uses of the general formula are perhaps not 
so important as the theoretical uses to which Professor Spearman has 
apphed it. 



CHAPTER VII 

THE INFLUENCE OF SELECTION 

Influence of mild selection on o- and r — Bigorous selection and partial correlation — 
Three correlated variables represented by dice throws — ^Multiple correlation — 
Spurious correlation — Variate difiference correlation method. 

(1) THE INFLUENCE OF MILD SELECTION 

The essential point about the whole theory of correlation is that it tells 
us how a group of individuals selected from the general population 
according to some characteristic (say as being within certain limits of 
height, or possessing some mental abiUty or manual dexterity in a high 
degree) will also difEer from the general population in other charac- 
teristics. 

The ordinary correlation coefficient already tells us much in this 
respect. For example, if the correlation between two abilities, say 
(1) the abihty, whatever it may be, which is measured by Dr McDougall's 
Dotting Machine and (2) the ability to memorise Nonsense Syllables 
according to certain experimental regulations, be known to be -4 for 
the whole population, this means that if a group be selected with 
"Dotting" abihty equal to x (measured from the general mean in 
a units) then this group will most probably have an average "Nonsense 
Syllable" abiUty equal to -ix (measured in a similar way). 

Clearly in practice we do not usually know the means and the 
correlation for the whole population but only for samples. We take 
large samples and endeavour to ensure that they are random and not 
selected samples from the population we wish to investigate. 

The selection contemplated in the above example is very rigorous: 
all the individuals are presumed aUke in regard to "Dotting" abiUty. 
In practice such a rigorous selection never takes place. The boys in 
a school form, for instance, are more alike in say abihty in Latin than 
the general population, yet not absolutely ahke. The "scatter" of this 
variate (Latin) has been reduced, yet not to zero. 

Just as selecting a group of individuals for one variate will alter 
the average value of other variates, so it will alter the scatter of these 
other variates, and their intercorrelations. It is this phenomenon which 
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in an extreme form gives us what we already know as "partial corre- 
lation" (seep. 105). 

In fact, selecting a group of individuals within certain limits of a 
quahty A impUes an indirect and less rigorous but frequently very 
important selection of the other qualities B, C, ... of these individuals 
and of their intercorrelations. Consider the simplest case of three organs, 
A being directly, B and C only indirectly selected. Let subscripts 1, 2 
and 3 refer to ^, 5 and C respectively, and let the standard deviations 
and correlations in the general population be a^, a^, a^, r^j, r^s and r^^. 
In the selected group a^ is reduced by the selection to Sj, and a^ and a^ 
are indirectly altered to Sg and Sg, r^g, t^, fzi to ri2, Tag, Xsi- Then the 
following formula;e enable these quantities to be calculated*. 

Write «i/<Ti = cos X\- 

Then Sg/o-a = sin a^gj ^^*1 ^zhz = ^i^i ®i3» 

where cos a^^ = ''w ^^^ Xu ^°<i ^os a^ = rjg sin Xv 

Further r^g = cot xi cot a^^, 

ri3 = cot xi cot Oi3, 

1'2Z ~ cos aj2 COS Oi3 
Too ==^ T : . 

" Sin ai2 sin a^ 

For example let us suppose for the moment that the correlations 
between (1) Classics, (2) Drawing, and (3) Enghsh have, in the general 
population of Enghsh Fourth Form boys, the values found on pp. 104 
and 105, viz. 

/i2 = -42, 

r^ = -21, 

rai = -78, 
and let us further suppose that, say on a standardised percentage system 
of marking, the standard deviations of the marks in these subjects are 

a-y = 16, 
0^2 = 13, 
a, = 14. 

Now suppose a mild selection of Fourth Form boys to be made on 
the basis of their abihty in Classics, and in the selected group let us 
suppose that the standard deviation in marks in Classics is reduced to 

Si = 12. 

* See Karl Pearson, F.R.S., "On the Influence of Natural Selection on the Variability 
and Correlation of Organs," Phil. Trans. 1902, co. A, pp. 1 — 66, where an interpretation 
in terms of spherical trigonometry is given. 
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We then have 



Further 



CORRELATION 

cos xi = Si/o'i = 12/16 = 0-75, 

X'=41°25', 
sin X = 0'66, 
cotx'= 1-14. 

cos ai2 = ''12 sin Xi = "42 x -66 = -28, 
ai2 = 73° 44', sin a-^^ = -96, 

cota^^— '29 > 
cos a^j = r^g sin xi = "78 x -66 = '51, 



*13 



= 59° 0', 



sin ai3 = -86, 
cot ai3 = •60. 



Sg = ^2 sin ai2 = 13 x -96 = 12-5, 
S3 = CT3 sin fflu = 14 X -86 = 12; 
rx2 = cotxi cot ffli2 = 1"1* ^ "29 = '33, 
Ijg = cotxi cot a^ = 1*14 X -60 = '68, 

r23— cosaiacosfflis '21 — -28 x -51 



sin a^, sm a, 



■96 x -86 



[PT. II 



= •12. 



*12 °i" "13 

Putting these results together for comparison we obtain the following 
tables : 

Standard Deviations 



Mild selection Rigorous selection 
Before selection in CUwsics in Classics* 



Cilassica 
Drawing 



16 
13 

14 



12 

12-6 

12 





11-8 
8-7 



Classics and Drawing 
Classics and English 
Drawing and English 



Correlations 

Mild selection Rigorous selection 
Before selection in Classics in Classics* 



•42 
•78 
•21 



•33 
•68 
•12 



-•21 



This mild selection for Classics has therefore left the scatter of abiUty 
in Drawing almost untouched, but has made the group somewhat more 
homogeneous than it was in English. The intercorrelations are all 
slightly reduced, that between Drawing and Enghsh being now almost 
nil. 

• See later. 
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(2) RIGOROUS SELECTION AND PARTIAL CORRELATION 

If we suppose the selection in Classics to be absolutely rigorous, so 
that the resulting group is absolutely homogeneous in abihty in that 
subject, then our formulae simphfy considerably and we are left with 

si = 0, 

.-. Xi=90°, 

sinxi=l, 
cos ai2 = fig, 
cosai3 = ri3; 

S3 = <y3v'(i-»'i3^); 

ri2 is meaningless, the variate 1 hemg fixed, and similarly x^; 

'23 '~~ 12 13 

This formula was reached by Mr Udny Yule* before the more general 
formulae were known, and is called the "partial" correlation of variates 
2 and 3 for a constant value of 1, and is written r^^. Mr Yule obtained 
its value by applying to three variables the methods we have already, 
following him, employed on p. 108 for two, using the Method of Least 
Squares. Let the regression equation of y on z and x be 

y = 623 ^ + ^21^- 
This gives the normal equations for 633 and 621 

S{yz) = h^S{z') + \^S{zx), 

Siyx) = b^S(zx) + b^j^S(x% 
whence, after a little simpUfication, 



h 



23 



^23 ^21^31 ^2 
'31 '^s' 



1 — r. 



and a similar expression for 621- From the regression of z; on y and x 
we get in like fashion 

1 _ ''23 ~ ^21^31 <^3 
1 '21 "2 

and ^23.1 = either 623 or ^32 'w^^i^n erg = 0-3 = unity, so that 

„ ^23 ~ ^21^31 

'oai — 



V(l-'-2l')V(l-r-3i'^)' 

* G. Udny Yule, "On the Significance of Bravais' Formula* for Skew Correlation," 
Proc. Roy. Soc. 1896, lx. pp. 477 — 489. 
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If we apply these formulae to our three variables (1) Classics, 
(2) Drawing, and (3) English, we shall get the standard deviations and 
correlations for a rigorous selection in Classics, given in the third column 
of the above tables. We see that the group is still heterogeneous in 
Drawing, but a good deal more homogeneous in Enghsh. The correla- 
tion between Drawing and Enghsh has now actually been reversed. 
Needless to say, the actual numbers in this example are not to be taken 
as giving the facts, being only used for the sake of illustrating the method. 

It is important to reahse, and becomes clear from the above con- 
siderations, that all correlations are partial correlations, inasmuch as 
there is always a selection of the group we are working with, for age, 
or race, or social standing, or what not. Indeed even the whole hving 
population is only a group, surviving from "what might have been," 
by natural selection. This wide point of view will save us from many 
of the errors into which we are apt to fall in handhng correlation 
coefi&cients. 

It is particularly tempting to draw what are usually fallacious con- 
clusions from the comparison of "entire" and "partial" correlations 
as to the underlying factors at work causing the correlations. For 
instance, in the present case one might be tempted to conclude that the 
original positive correlation between Enghsh and Drawing was entirely 
due to factors which these share with Classics, that is, to a general 
factor, and that any direct connection of these two subjects is of an 
"interference" nature. But such conclusions as to the underlying 
mechanism have to be made, if at all, with great reserve, as will be seen 
from examining cases where we have independent and first-hand know- 
ledge of the factors at work, as we have for example in dice throwing. 
A correlation can be set up between two dice throws of w and n dice 
respectively by leaving some of the m dice lying to form part of the 
second throw. 

(3) THBEE CORRELATED VARIABLES REPRESENTED BY 
DICE THROWS* 

Let n red dice, n blue, n yellow, and n white dice be thrown, and let 
the variable x be given by the combined red and white, y by the com- 
bined yellow and white, and z by the combined blue and white scores, 
as in Pig. 24. 

That is to say, there is a general factor (the white dice), common 

* This section is an extract from an article by Godfrey H. Thomson, Brit. Journ. 
of Peychol. 1919, ix. p. 323 et aeq. 
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to all tkcee variables, which causes all the correlations between them. 
These correlations are 




Orange 



Kg. 25 
The partial correlations are, by the well-known formula, 

r^.. = ry.: = r^.. = (i - ^ *)/V(l - {\f) (1 - (If) = i 
It is not permissible, however, to reverse this statement, and to assume 
that in every case where r^y = Ty, = r^a, = ^ the correlations are formed 
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solely by the action of a general factor. In fact, identically the same 
values can be produced without any general factor at all. Let n purple, 
n green, and n orange coloured dice be thrown, and let the variable x 
consist of the purple and orange, y of the orange and green, and z of the 
green and purple scores combined, as in Fig. 25. 

Here there is no general factor whatever. The connection of x with y 
(through the orange dice) is entirely independent of the connection of x 
with z (through the purple dice). 

Yet the correlations, both partial and entire, are exactly the same as 
in the first arrangement, viz. 



'trfu '«z *z; 



_ 1 
— 2 > 



'an/.z 'VZ'X ~~ 'zx.y 3 ■ 



Clearly, therefore, if we only know of three variables x, y, and z, 
formed of dice throws, that their correlations are as above, we cannot 
say with certainty whether a general factor exists or not. Let us now 
consider a more general arrangement of dice, with numbers of different 
colours, viz. W white, R red, B blue, F yellow, P purple, G green, 
and orange, thus: 




X consisting of the scores of the W, R, P, 0; y of the W, Y, 0, G, and 
z of the W, B, G, P dice. 

In this arrangement we shall call W a gewral factor, it being common 
to all three variables; R, Y, and B specific factors, they being unique to 
X, y and z respectively; and 0, G and P group factors, since each runs 
through a group of (here two) variables. 
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The theoretical values of the correlation between any two of the 
variables, say x and y, can be found by means of the formula* 
Number of dice common to x and y 
^ Geometrical mean of total dice in x and in y ' 
For example, if x is the score of 9»i dice, and y the score of 4w dice, 
In being common, the correlation is 

V9w X 4w o 
The general arrangement of dice shown in the above figure includes the 
two special cases already considered, viz. 

(1) p = G = o = zero, R=B=Y=W = n; 
and {2) P = G=0 = n, R = B = Y =W = zeio; 

which both give r^ = r^^ = r^^ = Jf- 

Of the infinite arrangements possible with this diagram, an infinite 
number (of a lower order) can in general be constructed to produce any 
given set of positive correlations between x, y and zf . Moreover, all 
these possible ways of producing the required correlations are, in our 
ignorance, equally likely to have been those used by the person making 
the arrangement of dice, although they are, it is true, not equally 
probable as chance occurrences. From the correlations, therefore, we 
cannot in general deduce what proportion the white dice (i.e. the par- 
ticular colour representing the general factor) bears to the others, for 
this proportion can vary between wide hmits, and give exactly the same 
correlations. The most we could conceivably do would be to give the 

* This formula was proved by me iaBrit. Journ. of Psychol. 1916, vm. p. 275, in ignorance 
of any former statement of it. Professor Spearman showed (ibid. p. 282) that it is deducible 
from a formula of his oonceming correlation of sums or differences. I have since noticed 
that Professor Spearman gives the following clear expression of the formula (Am. Journ. of 
Psychol. 1904, xv. p. 75) : " The correlationis always the geometrical mean between the two 
shares." I do not however agree with the application he there proceeds to make. The 
formula can also be directly deduced from Bravais, and in several other ways. It assumes 
that the elements all have the same standard deviation, as dice have. 6.H.T. 

t If he be so minded, the reader can make thousands of other dice patterns, all giving 
the correlations r^=^yz—^zx—i- One group is given by 

R=B=7 = W = hn,, P=G = 0=sn. 
But such a high degree of symmetry is unnecessary. For example, another pattern giving 
the same correlations is 

je=91re, T=91n, B=5in, P=78m, G=78m, 0=91m, W=^8n. 
The number of white dice present ranges between the two extreme cases given in the text. 

J A convenient method of finding such arrangements or patterns to give specified 
values of the correlation coefficients is explained by J. Ridley Thompson, Brit. Journ. 
of Psychol. 1919, x. p. 98. 
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"expectation" of the proportion of white dice. The meaning of this 
would be, that if a very large number of arrangements of dice were 
examined, each giving the required set of correlation coefficients, and 
if we assume that these arrangements of dice are not formed on any plan, 
beyond that they all agree in the correlations they produce, then the 
average proportion of white dice would be that named as the "expec- 
tation " thereof. But if there is any reason to think that the large number 
of cases examined are all of much the same pattern — as there would be 
were they all natural phenomena of the same sort — then the " expecta- 
tion" of the proportion of whit^ dice becomes useless and meaningless. 
We cannot conclude anything which is of any definite value in con- 
structing the pattern, except give limits within which it must lie. 

This brings us to the problem: — ^Are there any values of r^, r^^and 
rjj, which make it certain (having regard to their probable errors) that 
at any rate som£ general factor, some number of white dice, exists? The 
answer is that this is so if the correlations are large enough. For example, 
if we take the special case of equaUty of the three coefficients, 

then up to r = J the correlations can be imitated by various numbers of 
red, blue, yellow, purple, green and orange dice without any white dice. 
But as soon as the common value r rises above ^, some white dice are 
necessary. In this case therefore the proof of the existence of (at any 
rate) some amount of general factor reduces to the examination of the 
probable error of r, to see if r is indisputably greater than ^. 

In the more general case the matter is not so simple, the three values 
of r difEering from one another. The more detailed examination of this 
case is reserved for treatment elsewhere. It will be found however that 
if the quantity 

'xy ' 'vz ~r 'zx ' ^^3^'yz'zx 

is indisputably greater than unity, then some white dice, some general 
factor, may be postulated with certainty*. It may not be out of place 
to remind ourselves again that this, though true of the arrangements of 
dice we are considering, may not be true in the same sense of other 
phenomena, e.g. biological or mental phenomena. 

The following two examples illustrate the above principles. 

* A rough guide is the avercye value of the three r's: if this is greater than § some 
general factor certainly exists. The exact condition however is that given in the text. 
It is due, in this form, to Mr J. R. Thompson, see Brit. Journ. of Psychol. 1919, ix. p. 335. 
Note that all this only applies to correlations produced by overlapping dice throws, or by 
some sufficiently similar mechanism. 
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Example A 

Three variables, composed of overlapping dice throws, give corre- 
lations as follow: 

r^^ = 0-32, r„, = 0-33, r,^ = 0-54. 
Are any dice common to the three variables? 
In this case we find 

fxv^ + fyz^ + rj + 2r^r^^r^ = 0-56, i.e. < 1. 
From this we conclude that these correlations can be imitated either 
with or without a general factor of white dice. The following arrange- 
ments of dice do actually produce these correlations. 

Case (1). R = 19w, B = lln, Y = 85w, (specific factors), 
P = G = = zero, (no group factors), 

W = 21w, (a general factor). 

Case (2). R = 15n, B = I5n, Y = I6n, (specific factors), 
P = 47w, G = 25to, = 24w, (group factors), 
W = zero, (no general factor). 

Example B 

Three variables, composed of overlapping dice throws, give corre- 
lations as follow: 

r^y = 0-72, r^, = 0-77, r,^ = 0-67. 
Are any dice common to the three variables? 

In this case we have 

'ayy I 'yz t 'zx ' ^'xy'yz'zx ^^ X'vo, I.e. > 1. 

We conclude therefore that some white dice common to the three 
variables are present, i.e. that there is a general factor. The following 
arrangements of dice do actually produce these correlations, the general 
factor being a minimum in one and a maximum in the other. 

Case (1). R = 156w, B = 104m, Y = 52n, (specific factors). 







P = 

W = 


-G = = 

-- 260w, 


■■ zero. 






(no group factors), 
(general factor). 




Case (2). 


P = 

w = 


= 5= r = 

= 90w, G = 
= 198w, 


= zero, 
= 161w, 


= 


12Sn, 


(no specific factors) 
(group factors), 
(general factor). 


We 


see then that if 


















-. 2 1 ». 2 
'xy T lyz 


+ rj 


+ 2r 


xy'yz'zx 


>1, 



the presence of some white dice is certain. If the above quantity, which 
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we shall call D, is equal to or less than unity, the presence of white dice 
is uncertain. Suppose we consider two cases in which 

r^ = 0-8, r^, = 0-4, r,^ = 0-1, i) = 0-842, 
and r^ = 0-2, /„, = 0-2, r^^ = 0-1, D = 0-094, respectively. 

Can we in these two cases say anything as to the 'probability of the 
existence of some general factor? 

The answer to this question is twofold. If we suppose that the person 
making the arrangements of dice has, among all the possible arrange- 
ments giving 

r^ = 0-8, r„, = 0-4, r,^ = 0-1, I> = 0-842, 
chosen one by chance selection, and if we suppose that the other 
arrangement giving 

r^ = 0-2, /•,, = 0-2, r,^ = 0-l, 2) = 0-094 
has similarly been chosen by chance selection, then it is much more 
probable that a general factor esdsts in the first than in the second case. 
This probability will in fact rise and fall with D though it is not measured 
hjD. 

But if the person making the arrangements of dice has any definite 
rules which he follows in making the patterns, then the above probabihty 
will have much less meaning. In the former instance it would express 
the average to be found in many cases; in the latter it will no longer 
express even such an average value, and will have no practical worth. 

Before leaving for the present the subject of dice throws two points 
may be mentioned. (1) Negative correlations may be imitated by dice 
being added to one, but subtracted from the other, variable*. (2) There 
are many ways conceivable in which correlations can be produced other 
than by common factors. Consider for example the positive correlation 
between the number of hearts in my hand and the number of spades 
in my partner's, at whist. Doubtless the correlations found in mental 
phenomena are in many instances still more subtle in origin. 

(4) MULTIPLE CORRELATION 

The full significance of correlation is only to be reahsed after a careful 
study of the general theory of correlation of numerous variables, of 
which the correlation of two variables, measured by the correlation 
coefficient r, is only a particular case. The classic memoir on the subject 
is that by Professor Karl Pearson on "Eegression, Panmixia and 

* See J. B. Thompson, "The Bdle of Interfeience Factors in Producing Correlation," 
Brit. Joum. of Psychol. 1919, x. pp. 81 — 100. 
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Heredity," in 1896*, which is however too advanced for quotation at 
any length here. In 1907 Mr G. Udny Yule introduced a new notationf 
and made various improvements, and his formulae will now be briefly 
summarised. 

The regression equation for the two variates, x and y, both measured 
from their means, was found in a former section to be of the form 

X = hi^, 
the equation being obtained by an apphcation of the method of least 
squares. 

It is evident that the method thus applied to two variates might, 
without further assumptions, be used in the general case of n variates. 
Thus, if a!j, aig, ... a;„ denote deviations from means, the equation ex- 
pressing the regression of aj^ on ajg ... x„ could be written 

*1 ~ "12.34"-k'''2 + "13.24 — n^S + ••• + "ln.23 ■■•n-l^n- 

" In this notation, the su£&x of each regression coef&cient completely 
defines it. The first subscript gives the dependent variable, the second 
the variable of which the given regression is the coefficient, and the 
subscripts after the period show the remaining independent variables 
which enter into the equation" (Yule, op. cit. p. 182). 

In terms of the same notation, write 

''12.34 — n = ("12.34-"n-"21.34 — n)*> 

•^1.23."n = 3/1 — (O12.34 — n'''2 "I" •" ''ire .23 •■• n-l'*'n); 

Then the normal equations, from which the regressions are deter- 
mined by the method of least squares, may be written 

O [!^2-^i.23—n) = " (-^3 ••^1.23— re) = ••■ = O (^re • -^1 . 23 — n) ~ "• 

From these equations Mr Yule has deduced the following results : 



"12.34— « — '"l2.34...!J 



°'l.34...n 



<^\.Z3...n = <^1 (1 - »-12^) (1 - '•^3.2) (1 - ''^14.23) •••(!- '•V.23-»-l)> 
„ _ ^12.34—re-l ''1W.34--W-1 ^2«.34.-re-l 

'"12.34-re \ln-r'^ \r 

\^ ' lre.34...re-l^ \^ ' 2«.34...re-i; 

''12.34 — « ^^ known as a "partial" correlation coefficient, being the value 
of the correlation between 1 and 2 for constant values of 3, 4 ... w. 
Similarly, 612.34... « is a "partial" regression coefficient (cf. "partial 

* Phil. Trans, cxcvm. A, pp. 443 — 459. 

t Proc. Boy. Soc. 1907, lxxix. A, pp. 182—193. 

B. & T, 10 
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differentiation" in the Differential Calculus). Knowing the "total" 
coirelations r^g, r^, r^^, etc., we are enabled to obtain the various partial 
coefficientsby successive substitutions . Thus, inthecaseof three variables, 
1, 2, 3, we have 



r 



12.3 



^12 ^13^23 



Vl - r^,Wl - r^^' 



and two similar equations, expressing the value of the correlation be- 
tween two of the variables for a constant value of the third*. If a 
fourth variable be added, we have the further set of equations 



'12.3 ~ '14.3'24.3 
' 12 . 34 ■ 



(l-'-^4.3)*(l-'-^24.3)*' 

Perhaps a more convenient formula for obtaining the partial corre- 
lation of two variables for constant values of a third and fourth is 

^•12 (1 - >'34^) - '•m (^23 - ^24^34) 7^4^j- ^^•34) 

"^1 - '"13^- '■U^- V+ ^^IS^urai^l - 'TiZ - V - V + 2»-23»-34»'24 

When the partial correlation coefficients have been determined, the 
regressions can be found by substituting in the appropriate equations, 
and give at once the regression equations. As explained before, a 
regression equation gives the most probable value of one variable for 
given values of the remaining variables, the standard error in such a 
prediction being ffi.23...n) etc. 

An example of the method of applying the above formulae is given 
in the next few paragraphs, where the partial correlations, regressions, 
etc. in the case of four interrelated psychical capacities are worked out 
on Unes identical with those illustrated by Mr Yule in his paper. 

Example of Multiple Correlation f 
(Boys, ages 11—12; n = 66.) 

1. Crossing through two letters (e and r). 

2. Crossing through four letters (a, n, 0, s). 

3. Combination test. 

4. Mechanical memory test. 

Formula for multiple correlation : 

' ^ln.34 — B-1^2«.34— B-l 



'12.34 — n 



(1 *" ln.34..n-l) (1 ~ *" 2n.S4 — n-l) 



* For an interesting representation of correlation between three variables by a model 
showing the distribution of points in space, see G. Udny Yule, An Introduction to the 
Theory of Statistics, C. Griffin and Co., London, 1911, pp. 241—243. 

t W. Brown, Brit. Joum. of Psychol. 1910, ni. p. 317. 



CH. VII] THE INFLUENCE OF SELECTION 

For four variables this becomes: 

^12.3 ~ ^14.3^24.3 
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'12.34 - 


(1- 


-'■\4.3)*(l-'-'24.3)* 


> 






''12.3 




»"l2 — '■l3*'23 








^(T 


-'•l3^)*(l-'-23')*' 








Table I 






Correlation coefficient 


log(l-r») 






12 




0-78 


1-59284 






13 




0-45 


1-90173 






14 




0-40 


1-92428 






23 




0-48 


1-88627 






24 




0-29 


1-96185 






34 




0-52 


1-86308 








Table II 




Correlation coefficient 
(zero order) 


Product term 
of numerator 


Numerator 


Correlation coefficient 
(first order) 


log(l-r«) 


12 


0-78 


0-2160 


0-5640 


12-3 


0-7199 


i-68281 


13 


0-45 


0-3744 


0-0756 


13-2 


0-1377 


1-99169 


23 


0-48 


0-3510 


0-1290 


23-1 


0-2308 


1-97623 


12 


0-78 


0-1160 


0-6640 


12-4 


0-7570 


i-63038 


14 


0-40 


0-2262 


0-1738 


14-2 


0-2902 


1-96179 


24 


0-29 


0-3120 


-0-0220 


24-1 


-0-0386 


1-99348 


13 


0-45 


0-2080 


0-2420 


13-4 


0-3091 


1-95639 


14 


0-40 


0-2340 


01660 


14-3 


0-2176 


1-97893 


34 


0-52 


0-1800 


0-3400 


341 


0-4154 


1-91774 


23 


0-48 


0-1508 


0-3292 


23-4 


0-4027 


1-92316 


24 


0-29 


0-2496 


0-0404 


24-3 


0-0539 


1-99873 


34 


0-52 


0-1392 


0-3808 


34-2 


0-4536 


1-89996 


Table III 


Correlatic 
(firs 


m coefficient 
b order) 


Product term 
of numerator 


Numerator 


Correlation coefficient 
(second order) 


log(l-r'') 


12-4 


0-7570 


0-1245 


0-6325 


12-34 


0-727 


1-67345 


13-4 


0-3091 


0-3048 


00043 


13-24 


0-007 


1-99998 


23-4 


0-4027 


0-2340 


0-1687 


23-14 


0-272 


1-96662 


12-3 


0-7199 


0-0117 


0-7082 


12-34 


0-727 


— 


14-3 


0-2176 


0-0388 


0-1788 


14-23 


0-258 


1-97009 


24-3 


0-0639 


0-1567 


-01028 


24-13 


-0-152 


1-98985 


13-2 


0-1377 


0-1316 


0-0061 


13-24 


0-007 


— 


14-2 


0-2902 


0-0625 


0-2277 


14-23 


0-258 


— 


34-2 


0-4536 


0-0400 


0-4136 


34-12 


0-436 


1-90843 


231 


0-2308 


-0-0160 


0-2468 


2314 


0-272 


— 


241 


-0-0386 


0-0959 


-01345 


24-13 


-0-152 


— 


341 


0-4154 


-0-0089 


0-4243 


34-12 


0-436 


— 



10—2 
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The regression equation between changes in intelligence (as measured 
by the combination test) and changes in the other three variables is 

'"'3 — °31.24 ^1 + "32.14 ^Z + "34.12 *4- 

Calculation of regression coefficients. 

ai = 68-19, ag = 58-43, a^ = 16-34, a^ = 9-70, 
o^i.2S...n = cTi^ (1 - J-ia') (1 - '•'1S.2) (1 - '''14.23) - (1 - »''l«.23...«-l)- 

•■. f^3.24 = <^3 (1 - '■32')* (1 - '■'34.2)* 

= 12-77. 
Similarly, 

ffi. 24= 40-83, a3.i4= 13-27, a^. 14= 36-27, (T3.i2= 14-23, (74.12 = 8-82. 

Again, Oj2.34...n = ''12.34 — 71 



■^2.34-" 

_ '3. 

"31.24 — ''^31.24; 



6«,.,. = r,,.^^^-^=-002. 



"■1.24 

Similarly, 632.14= '099, ^3^.12= ''^^^^ 

Hence regression equation is 

Xj = -002x1 + '099x2 + "703x4 . 

The standard error (CT3.124) made in estimating x^ from aji, x^ and x^ 
by this equation 

= C73 (1 - r,,^)i (1 - r%2.i)* (1 - '■'34.12)* 
= 12-78. 

(5) SPURIOUS CORRELATION 
Correlation is said to be spurious when it is due to extraneous con- 
ditions and does not arise directly out of the functions under con- 
sideration. The term is one of relative and not of absolute significance, 
but its appropriateness will become apparent after a consideration of 
the following two examples : 

1. Heterogeneity of material. 

Let us suppose that two distinct groups of children have been 
measured for two characters A and B, and that the mean abihties, in 
both A and B, are higher in the one group than in the other. Then, 
even if there is no correlation between the two characters, as estimated 
from each group separately, a positive correlation will be obtained from 
the two groups taken together. On the other hand, if the mean abihty 
in A is higher, and the mean abihty in B lower, in the one group than 
in the other, a negative correlation will be obtained by taking the two 
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groups together. The correlation in each case will be due simply to the 
heterogeneity of the material employed. The difference in the mean 
values for the two groups must of course have some cause, such as a 
difference of nationahty, sex, or even locaHty (within any one town or 
district) from which the children are drawn; or, again, such a cause as 
the difference of disciphne to which the two groups have been subjected 
in the past. These are extraneous conditions and, if measurable, can 
be allowed for by employing the method of partial correlation. As a 
rule, however, they are not easy to determine quantitatively; hence 
their dangerous character. 

2. Index Correlation*. 

This is a form of spurious correlation which arises from the use of 
ratios for measurements. Thus if 

and Xi, ajg, x^ are uncorrelated with one another, it can be shown that 
the correlation between 2j and Zg, is 



m-m'M-c 



«3/ 



a quantity which may be as large as '5. 

As an illustration from psychology, we may mention the correlation 
of errors of observation, either of different individuals or of the same 
individual at different times. Even if the absolute errors are entirely 
uncorrelated, the percentage errors (errors expressed as percentages or 
other fractions of the observed quantity) will often show a large amount 
of correlation. In such a case as this, the important question arises 
whether we are, from other considerations, justified in using ratios (or 
indices) rather than absolute measures. If we are, the index correlation 
is not to be regarded as spuriousf. 

* Karl Pearson, "On a form of Spurious Correlation which may arise when Indices 
are used in the Measurement of Organs," Proc. Roy. Soc. 1897, LX. p. 489. In a note 
following Pearson's paper Sir Francis Galton iUuatrates the occurrence of index correlation 
by a simple and illuminating example. 

f Spurious correlation of this (as of any) kind can be eliminated by partial corre- 
lation. 
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(6) VARIATE DIFFERENCE CORRELATION, OR THE ELIMINATION OP 
< SPURIOUS CORRELATION DUE TO POSITION IN SPACE AND TIME* 
This is a method for determining the correlation of variations from 
the "instantaneous mean," by correlating corresponding differences 
between successive values. If two variates x and y are such that 

x = <^(t) + X, 

where X and Y are the parts of x and y independent of the time t, then 
it can be shown that on certain not unreasonable assumptions, 

if m is large enough, where ^D^ is the mth difference between successive 
values of x, and similarly ^Dy. For example: 

« A 2^X »D^ A 

47 

6 
53 -4 

2 4 

55 -5 

2 -1 

57 -1 

1 
58 

Clearly the number of corresponding cases to be correlated is reduced 
each time. The method is only valid, inter alia, when there are so many 
cases in the x column that this reduction is immaterial. The following 
numerical example will it is hoped serve to make clear any obscure 
points in the above necessarily short account. The correlations between 
differences are worked out until r remains steady for several successive 
differences. 

Numerical illustration of the Variate Difference Correlation Method^ 
The numbers headed Savings and Tobacco respectively are "indices" 
from Professor Georgio Mortara's article in the Giornale degli Economiste 
e Rivista di Statistica, February 1914. As they stand, they include a 
continuous secular increase both of savings and of consumption of 
tobacco which has occurred in Italy during the period in question, and 
their correlation is -984. But first differences only correlate to an extent 
•766 as is shown in detail in the following working: 

* Miss P. E. Cave, Proc. Boy. Soc. 1904, Lxxiv. p. 407; R. H. Hooker, Joum. Soy. 
Stat. Soc. 1905, Lxvm. p. 396; "Student," Biometrika, 1914—15, x. p. 179; and Andereon, 
Biometrika, 1914 — 15, x. p. 269, where probable errors are given. 

t From Beatrice M. Cave and Karl Pearson, Biometrika, 1914 — 15, x. p. 340. 
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Year 


Savings 

X 


l^X 


From 
5 


Sq. 


Product 


Sq. 


From 
3 


1^, 


Tobacco 

y 


1885 


47 
















82 






6 


1 


1 


1 


1 


1 


4 




1886 


53 
















86 






2 


-3 


9 


3 


1 


-1 


2 




1887 


55 
















88 






2 


-3 


9 


15 


25 


-5 


-2 




1888 


57 
















86 








-4 


16 


12 


9 


-3 







1889 


58 
















86 








-4 


16 


8 


4 


-2 


1 




1890 


59 
















87 








-4 


16 


8 


4 


-2 


1 




1891 


60 
















88 








-1 


1 


2 


4 


-2 


1 




1892 


64 
















89 








-4 


16 


8 


4 


-2 


1 




1893 


65 
















90 









-5 


25 


20 


16 


-4 


-1 




1894 


65 
















89 






3 


-2 


4 


8 


16 


-4 


-1 




1895 


68 
















88 






2 


-3 


9 


9 


9 


-3 







1896 


70 
















88 






3 


-2 


4 


8 


16 


-4 


-1 




1897 


73 
















87 






2 


-3 


9 


3 


1 


-1 


2 




1898 


75 
















89 






4 


-1 


1 


1 


1 


-1 


2 




1899 


79 
















91 






4 


-1 


1 


2 


4 


-2 


1 




1900 


83 
















92 






4 


-1 


1 











3 




1901 


87 
















95 






5 











1 


-1 


2 




1902 


92 
















97 






7 


2 


4 


-2 


1 


-1 


2 




1903 


99 
















99 






8 


3 


9 











3 




1904 


107 
















102 






8 


3 


9 


3 


1 


1 


4 




1905 


115 
















106 






12 


7 


49 











3 




1906 


127 
















109 






17 


12 


144 


36 


9 


3 


6 




1907 


144 
















115 






11 


6 


36 


36 


36 


6 


9 




1908 


155 
















124 






13 


8 


64 


32 


16 


4 


7 




1909 


168 
















131 






12 


7 


49 


28 


16 


4 


7 




1910 


180 
















138 






7 


2 


4 


6 


9 


3 


6 




1911 


187 
















144 






5 











16 


4 


7 




1912 


192 
















151 






145 




606 


247 


220 




69 
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In the table the variate x is the Savings Index, y is the Tobacco 
Index. We then have 

Mean of JD^ = 145/27 = 5-37, 
Mean of J)y = 69/27 = 2-56. 

Take 5 and 3 as provisional centres, so that d^ = -37, 6,2= — '44. 
From deviations from these points we get, as shown in the table, 

S (^D/) = 506, 

S(iZ),iD,) = 247, 

S (iZ)„2) = 220. 

Therefore the correlation (correcting for dj^ and dg) is 

247 + 5 
*" V(506 - 4) -v/(220 - 5) 

Second differences have a small negative correlation, which increases 
till with sixth differences we reach — '431, which seems to indicate that, 
when time has been ehminated, expenditure on tobacco in any year 
means less money saved. 



CHAPTER VIII 

THE CORRECTION OF RAW CORRELATION COEFFICIENTS 

Historical account — ^The elimination of irrelevant factors — Reliability coefficients — 
Correction for observational errors. 

(1) HISTORICAL ACCOUNT 

The history of the use of the theory of correlation in Psychology can 
hardly be said to have begun earlier than the commencement of the 
present century. During the previous twenty years, indeed, a great deal 
of work had been done by many observers in measuring simple mental 
abihties (by the "mental test" method) in larger or smaller groups of 
subjects, and attempts had even been made to determine in what way 
these abilities were related to one another and to more general mental 
ability, or "general inteUigence." 

Owing, however, to a universal lack of knowledge of the mathe- 
matical theory of correlation among psychologists during this period, 
the results were not obtained in a form suitable for comparison with 
one another, so that it is not surprising to find that they hopelessly 
contradict one another. The heterogeneity of the material worked 
with, the non-eUmination of irrelevant factors, and the absence of any 
measure of the "probable error" of the results make the conclusions 
drawn by the investigators themselves from their researches utterly 
unreliable. 

The first investigation showing any mathematical precision was that 
pubUshed by Clark Wissler* in 1901. It contained (inter alia) an account 
of the careful application of a large number of simple mental tests upon 
over 200 college students, and a correlation of the results with one 
another and with the students' marks in the various subjects of the 
college curriculum. The mental tests were found to correlate but shghtly 
with one another or with abiUty in college subjects of study, though 
these latter showed considerable correlation with one another (-30 — 75). 

In the following year Aikens and Thorndikef pubhshed results which 

* Clark Wissler, "The Correlation of Mental and Physical Testa," Psychological 
Review, Monograph Supplement, in. No. 16, June 1901. 

f " Correlations among Perceptive and Associative Processes," PsychologidU Beview, ix. 
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were in a sense confirmatory of those of Wissler, since, notwithstanding 
the greater similarity to one another of the functions investigated than 
in Wissler's research, the correlations were again found to be low. For 
example, different tests devised for the measurement of "speed of 
association" were found to show hardly any correlation — a result which 
seemed to furnish some justification for the author's statement that 
" quickness of association as an abiUty determining the speed of all one's 
associations is a myth" (pp. cit. p. 375). A similar lack of close relation- 
ship was found in the case of other mental functions which would, on 
the evidence of introspection alone, be confidently classed as particular 
instances of the same general mental function. 

In 1904 there appeared an epoch-making article by Professor C. 
Spearman*, the ideas originating in which have, at least in England, 
ever since dominated correlational work in its apphcations to psychology. 
Since we shall have frequent occasion in the course of this and the 
succeeding chapter to take exception to Professor Spearman's theories 
and mathematical methods, which appear to us incorrect and harmful, 
we may perhaps be allowed at this point, before embarking on contro- 
versial matters, to express our opinion that only Professor Spearman's 
enthusiasm and originality could have given to psychological correlation 
research the life and activity which it has shown during the last fifteen 
years. His work has stirred up both disciples and opponents to investi- 
gations which would otherwise never have occurred to them. 

The new ideas in question fall into two main groups. 

(1) Corrections to the raw values of correlation coefficients. Instead of 
measuring large numbers of individuals, as his predecessors had done, 
Professor Spearman contented himself with small numbers, groups of 
less than 40 in his first research, and as few as 11 in the second; but he 
proposes to make up for the unrehabihty thus introduced into the 
results by a more careful measurement of his cases, and the apphcation 
of "corrections" to the "raw" values of his correlation coefficients by 
means of appropriate mathematical formulae. 

(2) The discovery of " hierarchical " order among correlation coeffi^nents, 
and the Theory of General AbiUty, or the Theory of Two Factors, which 
has been built up on this foundation. This theory has been a great incentive 
to research, and may possibly correspond to the facts, though we do 
not incline to think so. But its deduction from the occurrence of 
"hierarchical" order among the correlation coefficients is invahd, as 

• "General Intelligence objectively determined and measured," Amer. Journ. of 
Psychol. XV. pp. 201—292. 
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will be shown in the next chapter. In the present chapter we turn to 
the closer consideration of the first group of ideas, the correction of raw 
correlation coefficients. 



(2) THE ELIMINATION OF IRRELEVANT FACTORS 

The first kind of correction is that for the eUmination of irrelevant 
factors, and is nothing new in the theory of correlation, being simply 
the method of partial correlation described in the last chapter. 

For example, to eUminate the effects of difference of age in the group 
experimented upon, one would determine the partial correlation between 
the two characters under consideration, for "age constant," by means 
of the Yulean formula given on p. 137. A similar procedure is needed 
for ehminating the effects of difference of sex, etc. A preferable course, 
however, would be to dispense as far as possible with the necessity for 
such corrections by selecting groups of individuals of the same age, 
sex, etc. Indeed, as Professor Spearman very properly points out*, 
the partial correlation formula must on no account be used in cases 
where there is too violent heterogeneity of the irrelevant factor. For 
example, we might with justice use it to ehminate the effects of age in 
a group where the extreme differences of age were only over a range of 
two or three years, but not in a group where the subjects ranged from 
say five years to fifteen years of age. 

(3) RELIABILITY COEFFICIENTS 

The second kind of correction introduced by Professor Spearman is 
a correction for what he calls "accidental" errors. This correction is 
based on the "rehabihty coefficients" introduced by Professor Spearman, 
and as these rehabihty coefficients are in themselves a very useful con- 
ception quite apart from their employment in this connection, we take 
this opportunity of explaining them. 

Anyone who has carried out psychological experiments, or even an 
ordinary examination, on the same subjects with similar tests on two 
or more different occasions does not need to be reminded that the results 
will differ, sometimes very decidedly. The order of merit of the children 
in a test to-day will not be the same as their order of merit in a closely 
similar test to-morrow. Unless however the differences are only shght, 
it is clear that the test or examination is of no practical use. Its re- 

* "Demonstration of Formulae for the True Measurement of Correlation,'' Amer. 
J (mm. of Psychol. 1907, xvm. especially p. 166. 
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liability can be conveniently measured by the correlation coefficient of 
the marks obtained on the two different occasions. Such a correlation 
coefficient is called a reUabiUty coefficient. In practice tests which give 
rehability coefficients lower than 0*6 are useless, and the ideal would 
be a great deal higher than this. 

(4) CORRECTION FOR OBSERVATIONAL ERRORS 

It is by the aid of these reUabihty coefficients, as we have said, that 
Professor Spearman carries out the calculations which have for their 
object the ehmination of observational errors. 

It is clear that if the correlation between two series of quantities is 
really perfect, then any observational errors made in measuring these 
quantities can only, and will actually, reduce the correlation. The 
amount of this reduction will probably be greater, the greater are the 
observational errors. The size of these is however indicated to some 
extent by the reliability coefficients, so that it is but a short step to use 
these to correct for the reduction, and enlarge the correlation coefficient 
to its true value. An example given by Professor Spearman himself* 
shows this so clearly that it is not out of place to repeat it here. 

"A target was constructed of a great many horizontal bands, 
numbered from top to bottom. Then a man shot successively at a 
particular series of numbers in a particular order. Clearly, the better 
the shot, the less numerical difference between any number hit and that 
aimed at; now, just as the measurement of any object is quite appro- 
priately termed a ' shot ' at its real value, so, conversely, we may perfectly 
well consider the series of numbers actually hit in the hght of a series 
of measurements of the numbers aimed at. When the same man again 
fired at the same series, he thereby obtained a new and independent 
series of measurements of the same objects. Next, a woman had the 
same number of shots at some set numbers in a similar manner. If, 
then, our above reasoning and formulas (see below) are correct, it should 
be possible, by observing the numbers hit and working out their corre- 
lations, to ascertain the exact resemblance between the series aimed at 
by the man and the woman respectively. In actual fact, the series of 
numbers hit by the man turned out to correlate with those hit by the 
woman to the extent of 0-52; but it was noticed that the man's sets 
correlated with one another to 0-74, and the woman's sets with one 
another to 0-36; hence the true correspondence between the set aimed 

* Amer. Journ. of Psychol. 1904, xv. p. 271. 
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at by the man and that aimed at by the woman was not the raw 0-52, but 

0-52 
*" ^ V(0-74 X 0-36) ^ ^'^^' 
that is to say, the two persons had fired at exactly the same series of 
bands, which was really the case." 

It will be seen that the formula for correction is "divide the raw 
correlation by the geometrical mean of the two reHabiUty coefficients." 
This formula, or rather the more accurate formula of which it is an 
approximate form, we shall prove presently. Meanwhile some general 
remarks on it may be made. Clearly, the formula will always increase 
the raw correlation, supposing, as is in practice usually the case, that 
the correlations concerned are all positive. As we have seen, it can at 
once be admitted that a perfect correlation will of course be reduced by 
observational errors, so that the raw correlation wiU need an increase. 
Similarly, it may on common-sense grounds be allowed that even when 
the true correlation is not perfect but only very high, the same will be 
true, but with less and less certainty as the correlation is really less and 
less perfect. 

In the example given, the true correlation was really perfect, and 
therefore it is clear that any upward correction will be an improvement, 
provided that it does not go past unity. But suppose we take a case where 
the true correlation is not unity, but some less amount. The following 
is an actual trial. Again two persons fired at a target, a man and a 
woman. The correlation between the series of numbers aimed at by 
the man and that aimed at by the woman, was 0-74. The measured 
correlation however between the series really struck by the man and that 
really struck by the woman was 0-65. The rehabihty coefficients were, 
for the man, 0-69 and for the woman, 0-79. The corrected value of the 
measured correlation is therefore 

_ 0-65 _ 

'-V(0-69x0-79)-^^^' 

which overshoots the mark, being 0-14 greater than the true value, 
whereas the uncorrected value was only 0-09 too small. 

The fact is, that the assumptions underlying Professor Spearman's for- 
mula are never fulfilled in practice. What these assumptions are can best 
be seen from the elegant proof given for the formula by Mr Udny Yule*. 

* See Appendix (e) to Professor Spearman's article, Brit. Journ. of Psychol. 1910, m. 
p. 294. There is a printer's mistake of omission in the last formula. See also W. Brown, 
"Some Experimental Results in Correlation," Comptes Bendus du F7"" Congrea Inter- 
national de Psyckologie, Genfeve, 1910, where the same proof is quoted. 
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Xi and 2/1 are measures of x and y at a certain series of measurements, 
X2 and ^2 S'l^ measures of a; and y at another series of measurements. 
Let Xj^= x+ 81, X2 = x + §2, 

Vi^y + ^v y2 = y + «2. 

all terms denoting deviations from means. 

Then, if it is assumed that 8, e, the errors of measurement, are 
uncorrelated with one another or with x or y, 

S {x8) etc. = 0, 
S (xjy^) = S (xy). 

Hence '■a!i»i*^i5i°'i' 

and similarly ''^xiVi'^x^^^v; 



4 _ tTjCi gga gyi gy, 

or Txy ^xiVi^x^yi^XiVi^Xiyi „ 4_ 4 

"x *^y 

But also, since S (x8) = 0, S (XjX^) = S (x^), 
and rxixi<^xi<^xi = <^x 

or fr^^o^^ = - — and Oy^Oy^ = - — , 



or 



'*» ^ ,2 «.2 > 

Geom. Mean of correlation coefficients 



'^ Geom. Mean of reliability coefficients 
It is this formula which is employed in the example given above of 
shooting at a target, the numerator there however consisting only of 
^(a!i+a:8)(»i+»2)) ^^^ subcalculatious for r^^y^ etc. not being made. 

Attention must be drawn to the assumption that the errors of 
measurement 8 and e are uncorrelated with each other or with x or y. 
^;y Now, these are very large assumptions to make. Even in cases where 
the quantities 8, e are genuine errors of measurement, there are strong 
reasons for assuming (on general principles and also from experimental 
evidence)* that they will be correlated. But in the case of almost all 
the simpler mental tests the quantities 8 and e are not errors of measure- 
ment at all. They are the deviations of the particular performances from 
the hypothetical average performance of the several individuals under 
consideration. Thus they represent the variability of performance of 

* See Karl Feaison, "On the Mathematical Theory of Errors of Judgment, with 
special reference to the Personal Equation," Phil. Trans, oxoviu. A, pp. 236 — 299. 
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function within the individual. When an individual in the course of 
three minutes succeeds in striking through 100 e's and r's in a page of 
print on one day, and 94 under the same conditions a fortnight later, 
there is no error of observation involved. The numbers 100 and 94 are 
the actual true measures of abiUty on the two occasions. The average 
or mean abihty, which is the more interesting measure for the purposes 
of correlation, is doubtless different from either, but that does not make 
the other two measures erroneous. Evidently in these cases S and e 
represent individual variability, and to assume them uncorrected with 
one another or with the mean values of the functions is to indulge in 
somewhat a priori reasoning. 

There are two comparatively simple ways of testing the assumption : 

(1) S {x^,} = S (xy) = S (x^^), 

.'. S (Xi^/i) — S (Xgj/a) should = within the Umits of the probable 
error of the difference. 

Brown has apphed this test to the case of correlation between 
accuracy in bisecting lines and accuracy in trisecting them in 43 adult 
subjects. 

Here S {bjtj) - S {hj,^) = 137780 - 60036 

= 77744, 

p.E. of S {xy) = -67449 ^'MnJPi^V^^ i^ Pearson's notation, 

•67449 f S {xyf S (x^) S (y^) 
~ ^/n '\ n n^ ' 

p.E. of S (6i<i) = 687 ; p.e. of S (b^t^) = 365 ; 

.-. p.E. of S (6i<i) - S (b^t^) = V6872 + 3652 

= 778. 

Since 778 is less than one-third of 77744, the formula cannot be 

employed to obtain the correlation between mean abihties in bisecting 

and trisecting hues. 

S {(a?! - x^) (yi - y^)} 
(2) r^,-|, - ^g (^^ _ ^^^,s (y^ _ y^), 

= 0, 
if errors are uncorrelated with one another (since numerator then = 0). 
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Applying this test to the same case of bisection and trisection, 
Brown gets 

»-B,-B,= 0-30 ±0-09, 

which proves once more the inapplicability of the formula. 

Brown applied test (2) also to the case of correlation between speed 
of addition of figures and accuracy of addition in a group of 38 school- 
children (girls between the ages of 11 and 12) and found 

rs,-s, = 0-35 ± 0-09. 

Even when test (2) does give the value 0, we can only conclude from 
this that 

S (S,,,) + S (8,62) = S (S^e,) + S (S^ei). 

Replying to these and other criticisms of his formula for eUminating 
observational errors, Professor Spearman* admits that many forms of 
error will be correlated with each other and with the true values of the 
quantities measured. But these, he says, are generally of a continuously 
progressive nature, and he proposes to eliminate their influence by 
making at least three measurements of x, and taking the first and third 
of these together as the x^ of the formula, the middle one as x^; or to 
use more complicated but essentially similar devices. No doubb, of 
course, this is a wise precaution to take, even though sceptics may still 
doubt whether the remaining so-called "accidental" variations are even 
yet uncorrelated with each other and with x and y. 

Brownf, using experimental data obtained by two independent 
observers estimating the lengths of Unes, found a considerable corre- 
lation ratio between the errors of observation and the true lengths of 
the lines, the regression being very far from Unear. He is of opinion 
that "no assumptions as to the correlation or non-correlation of such 
deviations are in the least justified." 

The correlation table between lengths and errors, and the corre- 
sponding regression curves, are shown in the following table and dia- 
gramj. 

In a review of Brown's article Mr J. R. Wilton§ has made some 
ingenious suggestions for weighting the different x's in a manner sug- 
gested by quadrature formulae. He remarks also that a grouping 
should be sought which would as far as possible satisfy the assumptions. 

* Brit. Joum. Psychol. 1910, m. p. 271. t Brit. Joum. Psychol. 1913, vi. p. 223. 

X Prom Brit. Joum. Psychol. 1913, vt. pp. 236 — 8. 
§ Joum. of Exp. Pedag. 1914, n. p. 302. 
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It may finally be pointed out that even if the errors S and e were 
known with certainty to ie uncorrelated with each other and with the true 
values, yet, with such small numbers of cases as are used in many of the 
psychological researches in which Professor Spearman's formula has been 
employed, the chance of, e.g. S (x^i^j) being nearly equal to S (x^^ is 
exceedingly small, and it is difficult to attach any meaning to an artificial, 
post hoc separation of the data into halves such that this condition is 
satisfied (and also others). The formula is at any rate inapphcable to 
samples such as 30, 24, 52, which have been freely used in experimental 
work; or if used at all, it can only be as a guide to the sufficiency of the 
sample. It is an essential of good work to use such samples that correc- 
tions to the raw values obtained are unimportant. 

Regression Curves of Correlation Table* 



Errors made, in mm. 

-g1 -19-17 -16 ,-13 -11 r-ft ^7 
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Fig. 27 
* See pp. 162—3. 
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CHAPTER IX 

THE THEORY OF GENERAL ABILITY 

This apparent unity is illusory. Man, in fact, is a microcosm as complex as the 
world which is mirrored in his mind; he is a federation incompletely centralised, a 
hierarchy of numerous and conflicting passions, ecuih of which has ends of its own, and 
each of which, separately considered, wouM give a different law of conduct. He is in 
some sense a unit, hut his unity is such as to include an indefinite immher of partly 
independent sensibilities. 

Leslie Stephbit, The Science of Ethics, p. 69. 

Discovery of "Hieraxchical" order among correlation coefficients — Use of the 
formula for the correction of observational errors to prove the existence of a 
general factor — Researches between 1904 and 1912 — ^A criterion for hierarchical 
order applied to numerous researches — Complications in the original theory. 

(1) DISCOVERY OP HIERARCHICAL ORDER AMONG CORRELATION 

COEFFICIENTS* 

The controversy as to whether abiUty in any individual is general, or 
specific, or in groups or "faculties" is a very old one, but for the purposes 
of the present chapter it is not necessary to go back prior to 1904, in 
which year there was pubUshed the first f of a series of articles in which 
Professor C. Spearman has developed his Theory of General Ability, or 
Theory of Two Factors, as it is alternatively named. 

Professor Spearman's method in that paper was to measure a number 
of mental abilities, some of them school subjects, others artificial tests, 
in a number of persons, and calculate the correlation coefficients of each 
of these activities with each of the others. These correlation coefficients, 
he then noticed, had a certain relationship among themselves, a relation- 
ship which may be called hierarchical order, and is explained in detail 
later. He saw, quite rightly, that the presence of a general factor would 
produce this hierarchical order among the coefficients, and, reversing 
this argument, he concluded that the presence of hierarchical order 
proved the existence of a general factor. 

In this first series of investigations Professor Spearman used the 
following groups of subjects: 24 village school-children of both sexes, 

* Sections 1, 3 and 5 of this chapter are largely extracts from an article by 6. H. 
Thomson in the Psychological Review, 1920, xxvn. p. 173. 

•f "General Intelligence objectively determined and measured," C. Spearman, Amer. 
Jmurn. Psychol. 1904, xv. pp. 201—293. 
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age limits 10-0 to 13-10; 23 boys of a high class preparatory school, 
age limits 9-5— 13-7; and 27 adults of both sexes, age Umits 21—78. 
The tests employed were those for pitch discrimination, weight dis- 
crimination, and discrimination of hght intensities; and measures of 
intelligence were obtained, in the case of the children, from results of 
school examinations, grading by teachers, and grading of one another 
by the children themselves (measure of common sense). 

The various school subjects in the preparatory school were found to 
correlate highly with one another, and when, with the inclusion of pitch 
discrimination and music, they were arranged in rows and columns, it 
was found possible to place them in such an order that the correlation 
coefficients formed a hierarchy, each being (with very few exceptions) 
greater than any to the right of it in the same row, or below it in the 
same column, thus 





Classics 


French 


English 


Math. 


Discrim. 


Music 


Classics ... 


— 


0-83 


0-78 


0-70 


0-66 


0-63 


French ... 


0-83 


— . 


0-67 


0-67 


0-65 


057 


English ... 


0-78 


0-67 


— 


0-64 


0-54 


0-51 


Math. ... 


0-70 


0-67 


0-64 


— 


0-45 


0-51 


Disorim.... 


0-66 


0-65 


0-54 


0-45 


— 


0-40 


Music 


0-63 


0-57 


051 


0-51 


0-40 


— 



This fact of "hierarchical" order which he had thus discovered was 
taken by Professor Spearman to indicate the presence of some common 
fundamental function which saturates in different degrees the different 
activities, and is the sole cause of correlation between them except in 
the case of very similar activities. 

It can easily be shown that if all the correlations are due solely to 
one common or general factor, then the correlation coefficients will be 
in perfect hierarchical order*. 

Let a and 'p be two mental tests or other activities, and g be the 
general factor. Then r^^.g is the correlation that a would have with p 
for constant g, and equals 



But if g is the sole source of correlation, r^xt.g must be zero, i.e. 

Similarly r^j, = rj,gr.j,g. 

Hence V = *«? and similarly = '^. 

r^g rjjp rjjQ 

* Hart and Spearman, Brit. Jaum. Psychol. 1912, v. p. 58 quoting Yule. Previous 
though less satisfactory proofs had also been given by Spearman. 
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A little consideration of this last equation shows that, if it be true 
for any four of the tests, it implies the possibiUty of arranging the corre- 
lation coefficients in the order we have termed " hierarchical " : and more 
than this, that the values of r in any one column of the "hierarchy" 
will bear a constant ratio, each to each, to their partners in any other 
column. 

Since clearly perfect hierarchical order cannot be expected in any 
experimental research, it becomes important to know what deviation 
from perfection can be allowed without giving up the idea of a general 
factor: or on the other hand, what approach to perfection can be attained 
without the presence of a general factor. These questions will occupy 
us presently. Meanwhile we turn for a while to another form of argument 
used by Professor Spearman. 



(2) USE OF THE FORMULA FOB THE CORRECTION OF OBSERVATIONAL 
ERRORS TO PROVE THE EXISTENCE OP A GENERAL FACTOR 

He considered that by the use of his formula for the correction of 
observational errors he could demonstrate the same thing (the existence 
of a " central function "), and could in particular show " that the common 
and essential element in the intelUgences wholly coincides with the 
common and essential element in the sensory functions*." The method 
of proof is as follows : 

Let ajj, x^ be two distinct measures of sensory discrimination, and 
yij Vi two distinct measures of intelligence. 

Then the correlation of the function common to the functions 
measured by x^, ccg with the function common to the functions measured 
^7 yi) 2/2 is equal to 

*» ^xii/i '^xiVj ^jUjifi ^(Cgga 

and if the two functions referred to are identical this expression should 
be equal to unity. 

In the present article. Spearman uses a simplified formula, 

V r^jXi f Villi 

and puts for the numerator the average of the various correlations 

* Amer. Jmm. Paychol. 1904, xv. p. 269. 
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evaluated between the intelligences and the discriminations, and in the 
denominator puts 

''itixz = the average correlation of the intellective gradings with one 
another, 

'"»i»2 = the average correlation of the gradings in discrimination with 
one another, 
and in this way gets results approximately equal to 1 in the different 
groups tested. 

One or two remarks may appropriately be made here. In the first 
place, the full formula is the only one that can be used with any meaning 
or justice since it is the only one which issues logically from the mathe- 
matical proof. In the second place, the apphcabihty of the true formula 
must be considered in the Ught of its presuppositions (mentioned above, 
p. 158). 

Indeed, the assumption that 8 and e are uncorrelated with each other 
or with X OT y seems even more unwarrantable here than in the case of 
"correcting" coefficients, for which the formula was originally devised. 

(3) RESEARCHES BETWEEN 1904 AND 1912 
A number of experimental researches on these hnes, in some of 
which Professor Spearman himself took part, were carried out during 
the eight years following 1904, but with very conflicting results, some 
experimenters finding the hierarchical order among the coefficients, 
others finding no such order. Two articles of this period, for example, 
are those of Mr Cyril Burt*, who found practically perfect hierarchical 
order, and Dr WiUiam Brownf , who found small trace of such order. 
A similar conflict of opinion was found with regard to the alternative 
method of attack, as for example in the research by Messrs Thorndike, 
Lay and DeanJ. The subjects examined were 37 young women students 
and 25 high school boys. The tests for sensory discrimination were: 

(1) drawing hnes equal to given Unes, and 

(2) filhng boxes with shot to equal in weight standard weights; 
those for intelligence were: 

(3) judgment of fellow-students, and 

(4) judgment of teachers. 

* Cyril Burt, "Experimental Tests of General Intelligence," Brit. Joura. Psychol. 
1909, m. pp. 94—177. 

•(• William Brown, "Some Experimental Results in the Conelation of Mental Abilities," 
Brit. Joum. Psychol 1910, m. pp. 29&— 322. 

t Thorndike, Lay and Dean, "The Relation of Accuracy in Sensory Discrimination 
to General Intelligence," Amer. Jonrn. PsycJujl. July 1909, xx. pp. 364 — 369. 
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For the high school boys (3) and (4) were combined teachers' and 
fellow-students' judgments and school marks, respectively. 

In the first case, Spearman's formula gave for the correlation of the 
factor common to (1) and (2) with that common to (3) and (4) the value 
0-26 instead of 1-00. In the second case, the value was 0-29. Moreover, 
Thorndike found a much higher correlation between discrimination of 
lengths and discrimination of weights than between either one of them 
and general intelUgence, the coeflBlcients being 

Accuracy in drawing lines, intelligence O-IS, 

Accuracy in making up weights, intelligence ... 0-25, 
Accuracy in drawing hnes and making up weights 0-50. 

Thus the results were in decided conflict with both parts of Spearman's 
concluding statement "that all branches of intellectual activity have 
in common one fundamental function (or group of functions) whereas 
the remaining or specific elements of the activity seem in every case 
to be wholly different from that in all the others*." 

Thorndike sums up as follows: "In general there is evidence of a 
complex set of bonds between the psychological equivalents of both 
what we call the formal side of thought and what we call its content, so 
that one is almost tempted to replace Spearman's statement by the 
equally extravagant one that there is nothing whatever common to all 
mental functions, or to any part of themf." 

Things were in this very unsatisfactory state when an important 
article by Professor Spearman, in cooperation with Dr Bernard Hart, 
appeared in 1912J. In this article the difficulty of making an unbiassed 
judgment as to the presence or absence of hierarchical order was recog- 
nised, and a form of calculation was given for obtaining a numerical 
criterion of the degree of perfection of hierarchical order, which criterion 
would be independent of any bias on the part of the calculator and would, 
it was hoped, give the true amount of hierarchical order, corrected for 
the samphng errors of experiment. This criterion ranges theoretically 
from zero, for absence of hierarchical order, to unity, for perfection 
of hierarchical order. But their formula can, arithmetically, exceed 
unity. 

* C. Spearman, ATner. Journ. Psychol, xv. p. 284. 
t Op. cit. p. 368. 

t "General Ability, its Existence and Nature," by B. Hart and C. Spearman, Brit. 
Journ. Psychol. 1912, v. pp. 51 — 84. 
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(4) A CRITERION FOR HIERARCHICAL ORDER 

The underlying idea was that if the above square table * of correlation 
coefficients shows hierarchical order in any degree, there will be corre- 
lation between the columns of that table taken in pairs, and that when 
the hierarchical order is perfect the columnar correlation R will rise to 
unity, except in so far as it is blurred by the sampling errors, which 
obviously cannot increase an already perfect correlation, but can only 
decrease it. Let us write dashed letters throughout for the true values 
of the various quantities, which in ordinary experiment are unknown, 
reserving undashed letters for their measured values. We then have : 

*■' = true correlation coefficient, 
e = its samphng error on one occasion, so that 

^ = r' + e, 

r' = mean of the column of true values r', 
r = mean of the column of observed values r. 

In finding these means, that coefficient is omitted which has no partner 
in the column with which correlation is being found. Write also 

p' = r' measured from the mean of the true column, i.e. 

= r' — r', and similarly 
p = r measured from the mean of the observed column, i.e. 

= r — f, 
e=p-p',=e-e, 
where e is the mean of the column of e's. 

Then for two columns a and h, the true columnar correlation which 
we desire to know is 

p /_ " \Pxa. Pxb ) /1\ 

•* V{S{pJ')S(p^'^)} ^'■>' 

by the Bravais-Pearson product-moment formula, S indicating summa- 
tion over the various values of x, i.e. summation up the column. This 
can be written 

^ ' ^ S iPmPxb) — S (exqe^) — S (p^'e^) — S (pro'e-a) 

V{S(PxaPxa) — S{ea„e^) — 2S(p^' e^)}'\/{S(p^p^) — S{eta,exb) — 2^(Pa*'€a»)} * 

In this expression, the three quantities of the form S (pp) are known. 
The three quantities of the form S (ee) are not known, but an attempt 
can be made to estimate their probable values from the known standard 
deviations of the correlation, coefficients. The four quantities of the 

* p. 165. 
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form S (p'e) are treated by Dr Hart and Professor Spearman, in their 
paper, as negligible, on the ground that p' will not in general be correlated 
with e. This assumption, as we shall presently see, was erroneous. 

The formula at which Dr Hart and Professor Spearman eventually 
arrive, after neglecting these quantities and making various other 
assumptions, is 

jf , ^ S ipxaPnib) — (W — 1) rg^Oxgf'xb ^^^^ ,2) 

■* V{S ip.a^) -{n-1) a~^} V{S (pj) - (n - 1) a~2} >' 

where the ct's are standard deviations of the correlation coefficients, the 
bar indicates mean values for the column, and n is the number of 
pairs of correlation coefficients concerned, in the two columns. In using 
their formula, its authors do not apply it to all the pairs of columns 
in the square table. They say: "In any case the correction must be 
kept within Umits : as usual, the larger the correction the less it is to be 
trusted. If the samphng errors are large enough, they eventually will 
quite swamp the true differences of magnitude upon which the observed 
correlation should be based. In this case, the true correlation is beyond 
ascertainment; any attempt at correction is merely illusory. To avoid 
this, and at the same time to ensure impartial treatment of all data, 
it is necessary to fix beforehand some definite hmit to the feasibility of 
correction. We have here adopted the following standard: in order to 
attempt to estimate the correct correlation between columns, it is 
required that in each of these columns the mean square deviation should be 
at least double the correction to he applied to that deviation." 

That is to say, the equation (2) is not to be used unless, in each factor 
of the denominator, S (p^) is at least double its correction (w — 1) a^. 
This condition (the "correctional standard") will be found to be im- 
portant. 

The authors applied their criterion to all the experimental work 
available, work dating from various periods, and representing the re- 
searches of 14 experimenters on 1463 men, women, boys and girls. 
From beginning to end the values of the criterion were positive and very 
high. The mean was almost complete unity. That is to say, Dr Hart 
and Professor Spearman claimed that all the data then available showed 
perfect hierarchical order among the correlation coefficients, even the 
data of workers like Dr Brown and Professor Thorndike, who had been 
unable to detect any such order. The reasons why the hierarchical order 
among the correlation coefficients was not obvious at a glance were, 
according to these authors, two. In the first place, their theory did not 
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entirely deny the presence of Group Factors of narrow range, and tests 
whicli were too similar were, according to them, to be pooled, before the 
hierarchical order would become apparent. Onlyin veryf ew cases however 
did they find it necessary to pool tests in the data used. In the second 
place, the obscuring of the perfect hierarchical order was, according to 
them, due to the fact that only a small sample of subjects is examined. For 
this error allowance is made in the formula for calculating their criterion. 

Dr Hart and Professor Spearman therefore considered their " Theory 
of Two Factors " proved. This theory considers abihty in any activity 
to be due to two factors. One of these is a General Factor, common to 
all performances. The other is a Specific Factor, unique to that par- 
ticular performance, or at any rate extending only over a very narrow 
range including only other very similar performances. "It is not 
asserted," they say, "that the General Factor prevails exclusively in 
the case of performances too alike, but only that when this hkeness is 
diminished, or when the resembling performances are pooled together, 
a point is soon reached where the correlations are still of considerable 
magnitude, but now indicate no common factor except the General one." 

In the same paper Dr Hart and Professor Spearman consider, and in 
their opinion confute, two other theories, (a) the older view of Professor 
Thorndike, viz. a general independence of all correlations, and (b) Pro- 
fessor Thorndike's newer view of "levels," or the almost universal belief 
in " types." If the former were true, their criterion would, they consider, 
show an average value of about zero : if the latter, a low minus value. 

Their argument runs as follows. 

If none but quite Specific Factors are present, the correlations will 
all be zero, and the pairs of columns will show no correlation with one 
another. If however correlations exist, but are due to Group Factors 
alone, then tests which share a Group Factor will correlate highly, but 
others will not correlate at all. Let there be three such Group Factors; 
then we shall obtain not a hierarchy but an arrangement hke this: 
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h=iagh. correlation. l=low correlation. See Brit. Journ. Psychol. 1912, v. p. 57. 
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in which the high correlations are concentrated along the diagonal. In 
this arrangement some columns will correlate positively, namely those 
in which the high correlations come opposite one another; but these 
will be in the minority and most pairs of columns will correlate negatively. 
Professor Spearman and Dr Hart conclude therefore that in the absence 
of a General Factor the average correlation between columns will be 
either zero or negative, and that only a General Factor will give a very 
high positive correlation between pairs of columns. 

In this consideration of Group Factors however it has been tacitly 
assumed that there is no overlapping of such factors. If this were so 
then indeed a hierarchy would be impossible. But it is at any rate a 
conceivable hypothesis that such overlapping should occur, that for 
example there might exist a factor common to three tests a, h, c and 
another common to c, d, e, so that c contains both factors : and on this 
hypothesis an excellent hierarchy can be obtained without any General 
Factor, and the average column correlation can even approach unity, 
as we shall show presently. 

(5) COMPLICATIONS IN THE ORIGINAL THEORY 

Many experimental researches were inspired by this paper of Dr Hart 
and Professor Spearman, of which, as a good example, may be cited 
one in 1913 by Mr Stanley Wyatt*. It is not too much to say that 
in practically all of these the application of the Hart and Spearman 
criterion gave values closely approximating to unity and therefore sup- 
porting the Theory of General AbiUty. But compUcations began to 
arise, of which the first of importance will be found in Dr Edward Webb's 
monograph on "Character and IntelUgence," in 1915f. Dr Webb con- 
sidered that he had found (in addition to Professor Spearman's General 
AbiUty) a second general factor, which he calls "persistence of motives." 
Other writers began to find that their data required for their explanation 
large Group Factors, of wider range than those contemplated in the 
original form of Professor Spearman's theoryj. Quite recently Mr J. C. 
Maxwell Garnett, discussing the data of a number of workers with the 
aid of mathematical devices which he has introduced for the purpose, 

* Stanley Wyatt, "The Qasintitative Investigation of Higher Mental Processes," 
Brit. Joum. Psychol. 1913, vi. pp. 109—133. 

t E. Webb, "Character and Intelligence," Brit. Joum. Psychol, Monog. Supplement, 
1915, No. 3, pp. ix and 99. 

i See especially N. Carey, "Factors in the Mental Processes of School Children," 
Brit. Joum. Psychol. 1916, vm. pp. 170 — 182. 
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concludes that in addition to the single general factor of Professor 
Spearman, there are two large Group Factors which are practically 
general* (one of them being indeed almost identical with Dr "Webb's 
second general factor), which he calls respectively "Cleverness" and 
"Purpose," both distinct from General Abihty. 

It is clear therefore that in any case the simple original form of 
Professor Spearman's theory is becoming comphcated by additions 
which tend to modify it very considerably. Meanwhile, however, one 
of us had come definitely to the conclusion that the mathematical 
foundations upon which it was based were in fact incorrect. Before 
developing the Une of argument which led to this, it will be well to re- 
state Professor Spearman's case in its simplest terms in a few words. 

It is entirely based upon the observation and measurement of hierarchical 
order among correlation coefficients. It states that after allowance has been 
made for sampling errors this hierarchical order is found practically in 
perfection. And, it finally states that such a high degree of perfection can 
only be produced by a General Factor, and the absence of Group Factors, 
which would mar the perfection. 

* J. C. Maxwell Gamett, "General Ability, Cleverness, and Purpose," Brit. Journ. 
Psychol. 1919, rx. pp. 345—366. 



CHAPTER X* 

A SAMPLING THEORY OF ABILITY 

The case against the vaUdity of Professor Spearman's argument — ^Hierarchical 
order produced by random overlap of group factors, without any general factor — 
Application of the "criterion" to these cases, apparently proving the presence of 
a general factor — ^The erroneous nature of the "criterion" — ^Hierarchical order the 
natural order among correlation coefficients — ^A sampling theory of ability — ^Transfer 
of training — Conclusions. 

(1) THE CASE AGAINST THE VALIDITY OF PROFESSOR SPEARMAN'S 

ARGUMENT 

As we tave already seen in previous chapters, it is possible, by means 
of dice throws or in other ways, to make artificial experiments on 
correlation, with the immense advantage that the machinery producing 
the correlation is known, and that therefore conclusions based upon the 
correlation coefficients can be confronted with the facts. Working on 
these Unes, one of usf made, in 1914, a set of imitation "mental tests" 
(really dice throws of a comphcated kind), which were known to contain 
no General Factor. The correlations were produced by a number of 
Group Factors which were of wide range, and, unhke Professor Spearman's 
Specific or Narrow Group Factors, they were not mutually exclusive. 

These imitation mental tests, containing no General Factor, gave 
however a set of correlation coefficients in excellent hierarchical order, 
and the criterion was when calculated found to be unity, so that had 
these correlation coefficients been pubhshed as the result of experimental 
work, they would have been claimed by Professor Spearman as proving 
the presence of a General Factor. In a short reply Professor Spearman 
laid stress on the fact that this arrangement of Group Factors which 
thus produced practically perfect hierarchical order was not a random 
arrangement, that it was exceedingly improbable that this one special 
arrangement should have occurred in each of the psychological researches 
of many experimenters, so improbable indeed as to be ruled entirely 
out of court J, and that a random arrangement of Group Factors, though 

* Much of this chapter, and part of the preceding, consists of extracts from "General 
versus Group Factors in Mental Activities," by G. H. Thomson, Psychol. Review, 1920, 
xxvn. p. 173. 

t Godfrey H. Thomson, "A Hierarchy without a Greneral Factor," Brit. Joum. 
Psychol. 1916, vm. pp. 271—281. 

I C. Spearman, "Some Coimuents on Mr Thomson's Paper," Brit. Joum. Psychol. 
1916, vm. p. 282. 
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it might give some hierarchical order, would not give it in the perfection 
actually found. The obvious way to find out if this is so or not is to try it, 
with artificial " mental tests " formed of dice throws. This was done in 
November and December of 1918, after an unavoidable delay of some 
years. Sets of artificial variables (analogous to the scores in mental 
tests) were made, in each of which the arrangement of Group Factors 
was decided by the chance draws of cards from a pack*. It was found 
that hierarchical order resulted, which when measured by the " criterion " 
appeared to be perfect. 



(2) HIERARCHICAL ORDER PRODUCED BY RANDOM OVERLAP OE 
GROUP FACTORS, WITHOUT ANY GENERAL FACTORf 

Write down the letters x-^, x^, x^, ... as the names of the variates to 
be formed, and prepare columns to receive the numbers of group factors 
and specific factors in each variate. Determine each number by the 
draw of a card from an ordinary playing pack, returning the card and 
shuflfling between each draw : the knave, queen, and king may be counted 
11, 12, and 13 respectively. The result of one such set of drawings is 
shown in this table : 



Variate 


Group factors 


Specific factors 


Total 


'Ki 


5 


5 


10 


Xi 


5 


3 


8 


"a 


12 


12 


24 


Kj 


1 


3 


4 


»6 


7 


6 


13 


X> 


9 


5 


14 


X, 


13 


13 


26 


"a 


1 


2 


3 


X, 


9 


3 


12 


«10 


11 


5 


16 



Proceed next to identify the group factors of each variate. Do this 
by using a single suit of the pack. After shuffling it well, lay out the 
top five cards to represent the five group factors in x^, and note them. 

* Godfrey H. Thomson, "On the Cause of Hierarchical Order among the Correlation 
Coefficients of a Number of Variates taken in Pairs," Proceedings of the Royal Society of 
London, 1919, xov. A, pp. 400—408. See also, by the same author, "The ffierarchy of 
Abilities," and "The Proof or Disproof of the Existence of General AbiUty," in Brit. 
Journ. Psychol. 1919, ix. pp. 321—344. 

t This section and also section 5 consists largely of extracts from the Proc. Boy. Soc. 
1919, xov. A, pp. 400—408. 
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After replacing them and reshuffling, do the same for x^, and so on, as 
in this table: 
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The next step is to note the number of factors common to each pair of 
variates, as in this table: 





«! 


Xt 


x» 


Xi 


X5 


Xe 
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Xe 


X, 


Xio 


Xl 
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— 
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1 
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X, 


5 


2 


8 





5 


5 


9 
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8 
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4 


10 
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6 
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a 




8 


— 



From these, and from the total number of factors both specific and group 
in each variate, can be found the correlation which would occur between 
the variates were we to throw dice, one to each factor, and repeat the 
throwings a large number of times. The formula is 

Number of common factors 
Geometrical mean of totals' 

This formula is appUcable not only to variates formed by the addition 
of dice, but to variates which are any function of the factors or elements, 
provided that the form of the function is the same in each variate, and 
that the standard deviation is the same for each element or factor*. 
We thus obtain the following table of theoretical correlation coeffi- 
cients: 



* 6. H. Thomson, loc. dt. p. 275. It can readily be deduced from Bravais, Mimoires 
de Vlnatitut de France, 1846, ix. eqn. 28. 
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Ki 


a^a 


»a 


Xt 


a^e 


x» 


X, 


Xg 


*■» 


i^lO 


Totals 


Xi 





0-22 


0-32 


0-00 


0-44 


0-17 


0-31 


018 


0-46 


0-32 


2-42 


3=2 


0-22 


— 


0-39 


0-18 


0-29 


0-47 


0-35 


0-00 


0-20 


0-35 


2-45 


a^3 


0-32 


0-39 


— 


010 


0-39 


0-44 


0-48 


0-12 


0-62 


0-51 


3-37 


»4 


000 


0-18 


0-10 


— 


014 


013 


0-10 


000 


000 


012 


0-77 


^i 


0-44 


0-29 


0-39 


0-14 


— 


0-30 


0-38 


0-16 


0-40 


0-41 


2-91 


Xi 


0-17 


0-47 


0-44 


013 


0-30 


— 


0-47 


000 


0-38 


0-53 


2-89 


K, 


0-31 


0-35 


0-48 


0-10 


0-38 


0-47 


— 


0-11 


0-51 


0-54 


3-25 


»^8 


0-18 


0-00 


012 


000 


0-16 


0-00 


0-11 


— 


0-17 


015 


0-89 


X, 


0-46 


0-20 


0-62 


0-00 


0-40 


0-38 


0-51 


017 


— 


0-58 


3-32 


a^io 


0-32 


0-35 


0-51 


012 


0-41 


0-53 


0-54 


0-15 


0-58 


— 


351 



If we wished to obtain experimental values of these, dice would have 
to be thrown, one to each factor. The die corresponding to the group 
factor called "Ace" would have its score counted into every variate 
containing the group factor in question. The dice representing the 
specific factors would, of course, only be counted into the one variate 
in which they occur. 

The last column of the preceding table gives the total correlation of 
each variate with all the others, found by adding the rows of the square 
table. Rearrange now the sequence of the variates in the order of 
magnitude of these totals*, and we obtain the following table: 





Xw 


3=3 


X, 


X, 


a^s 


Xt 


x^ 


»i 


^8 


Xt 


*10 





0-51 


0-58 


0-54 


0-41 


0-53 


0-35 


0-32 


0-15 


0-12 


*» 


0-51 


— 


0-62 


0-48 


0-39 


0-44 


0-39 


0-32 


012 


0-10 


Xg 


0-58 


0-62 


— 


0-51 


0-40 


0-38 


0-20 


0-46 


0-17 


0-00 


X, 


0-54 


0-48 


0-51 


— 


0-38 


0-47 


0-35 


0-31 


0-11 


0-10 


»6 


0-41 


0-39 


0-40 


0-38 


— 


0-30 


0-29 


0'44 


016 


0-14 


X, 


0-53 


0-44 


0-38 


0-47 


0-30 


— 


0-47 


017 


0-00 


0-13 


Xi 


0'35 


0-39 


0-20 


0-35 


0-29 


0-47 


— - 


0-22 


0-00 


0-18 


Xi 


0-32 


0-32 


0-46 


0-31 


0-44 


0-17 


0-22 


. — 


0-18 


0-00 


Xg 


0-15 


012 


0-17 


0-11 


0-16 


000 


0-00 


0-18 


— 


000 


Xi 


0-12 


0-10 


000 


010 


0-14 


013 


0-18 


0-00 


000 


— 



Here the tendency to hierarchical order is quite noticeable. This par- 
ticular example is purposely chosen from among a number calculated, 
as being that which shows the least hierarchical tendency. Even here 
however there is clearly a general lowering of the coefficients as we 
pass either along a row or down a column. The columnar correlation 
is high for the first few columns. For the columns headed a^Q and Xg it is 
0"97, and as far down as the columns headed x^ and Xg it is still 0-65. If 
these theoretical numbers were blurred by experimental error, they might 
well be claimed as having come from a perfect hierarchy by the criteria 
in vogue. Other hierarchies chosen at random from those formed in 
the above manner, show still more perfect hierarchical order, and some- 
* A convenient p.an, but of no theoretical significance. 
B. & T. 12 
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times the hierarchical order is almost quite perfect, as in ibhe example 
given in the British Journal of Psychology, 1919, ix. p. 343. 



(3) APPLICATION OF THE "CRITERION" TO THESE CASES, 
APPARENTLY PROVING THE PRESENCE OF A GENERAL FACTOR 

The values of the correlation coefficients given in the above table 
are of course the real values. To obtain experimental values of these, 
dice were thrown, one die to each Group or Specific Factor, and the 
whole repeated 20 times, analogous to experiments on 20 subjects*. 

From the dice scores the observed correlations between the variates 
can be calculated, just as the correlations between mental tests are 
calculated. Using the product-moment formula we obtain the set of 
values in this table, arranged in hierarchical order, only slightly different 
from the true hierarchical order, except that variate x^ has changed 
its position rather violently. 

The Observed Hierarchy 





a^io 


"h 


x» 


a^s 


x^ 


«6 


*i 


''i 


2^4 


a!* 


^10 





•72 


■47 


•64 


•53 


•50 


•34 


•45 


•21 


•09 


■^ 


•72 


— 


•48 


•43 


•75 


•48 


•32 


■67 


-•26 


•10 


■■'« 


•47 


•48 


— 


•51 


■46 


•45 


•50 


•46 


-•02 


•24 


*s 


•64 


•43 


•51 


— 


•58 


•60 


•20 


•15 


•29 


•08 


x^ 


•53 


•75 


•46 


•68 


— 


•63 


•26 


•33 


•05 


-11 


*6 


•50 


•48 


•45 


•60 


•63 


— 


•22 


•29 


-16 


•18 


^1 


•34 


•32 


•50 


•20 


•26 


•22 


— 


•41 


•38 


•15 


^S 


•45 


•67 


•46 


•15 


•33 


•29 


•41 


— 


-•20 


•08 


aJj 


•21 


-•26 


-02 


•29 


•05 


-•16 


•38 


-•20 





-11 


a^s 


•09 


■10 


•24 


•08 


-11 


•18 


•15 


•08 


-•11 


— 



The pairs of columns which pass the Hart and Spearman correctional 
standard give the following values: 



Columns passing 
standard 


Observed columnar 
correlation E 


True columnar 
correlation 


The Hart and Spearman 

corrected columnar 

correlation R' 


2 and 7 
6 „ 7 
2 „ 3 

2 „ 6 

3 „ 6 


073 
0-63 
0^70 
081 
066 


075 
0-89 
060 
0^88 
0^83 


076 
1-15 
101 
106 
104 


Means 


071 


0-79 


100 



True mean columnar correlation of the whole table and not merely ) 
of the pairs of columns selected by the correctional standard \ 

* 6. H. Thomson, Biometrika, 1919, xn. pp. 355 — 366, where the full details of the 
dice throws are given. Sections 3 and 4 of the present chapter consist largely of extracts 
from Biometrika, where the diagrams 28 and 29 appeared. 
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Dr Hart and Professor Spearman would therefore claim the hierarchy 
as being a sample of a perfect one. The true mean columnar correlation 
for the whole table is 0-59, the Hart and Spearman correctional standard 
selects pairs of columns whose true mean columnar correlation is 0-79, 
and the mean value of these when corrected according to their formula 
rises to unity. This example goes far towards shaking confidence in 
their criterion. 



(4) THE ERRONEOUS NATURE OP THE HART AND 
SPEARMAN CRITERION 

The inaccurate and exaggerated estimates of hierarchical order which 
are given by this "criterion" arise chiefly from two causes, (1) the 
erroneous assumption that p and e are uncorrelated (see p. 170), and 
(2) the action of the "correctional standard." We shall consider these 
in turn. 

Consider the formula for the standard deviation of a correlation 
coefficient, viz. 

_ l-r^ 
'''~ V^ ' 

where iV is the number in the sample. It follows from this that the 
larger correlation coefficients will probably have the smaller sampUng 
errors e, disregarding the sign of e for the moment. 

But these signs of the quantities e are not likely to be indiscriminately 
positive and negative. On the contrary, they will have a tendency to 
be either all positive or all negative, if, as is the case in most of the 
columns of coefficients considered by Professor Spearman, the corre- 
lations in the square table are mainly positive. The errors in the corre- 
lation of a variate ccj^ with a variate a are themselves correlated with 
the errors in the correlation of the variate a with another variate x^, 
according to the formula* 

W.. - 2(1- r^J) (1 - f^2) 

That is, the correlation of the samphng errors of r^^^ '^^ the sampling 
errors of r^^ depends chiefly upon t^^^. To illustrate, let us take three 

* Karl Pearson and L. N. G. Filon, " On the Probable Errors of Frequency Constanta," 
Fhxl. Trans, of the Royal Soc. 1898, cxci. A, eqia. 37. 
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correlations from an experiment in psychology, carried out by Mr Wyatt*. 
If we let 

Xj^ be the mental test "Rearranged Letters," 



"Missing Digits," 
"Analogies," 



the values there found were 



rx,a = 0-63, 
r^„ = 0-61. 



Then by the above formula the correlation of the errors of these two 
coefficients depends chiefly upon r^^^, whose measured value is 0-63. 
Using the full formula, and employing the measured values in default 
of the true ones, the correlation between r^^^ ^^^ ^x^a turns out to be '47. 
It is therefore (to an extent indicated by this value) probable that they 
are either both too large or both too small. The same argument holds, 
in varying degrees, for the other correlations all over Mr Wyatt's table, 
which are all positive. They all have a tendency to be either all too 
large or all too small: in other words, the e's tend to be all of the same 
sign. The relationship between the correlation coefficients of a column, 
and their errors, can therefore be summed up in the following table, 
in which the symbol | e \ denotes the magnitude of e regardless of sign. 



r' 


|e| 


p' 


e or c 


p'e or p'e 


large 

V 

small 


small 

A 

large 


+ 
+ 

+ 


+ + + 1 1 1 1 

1 1 1 + + + + 


1 1 1 + 1 1 1 
+ + + 1 + + + 








S{p'e) = 


- or + 



The first column shows the true correlations r' arranged in order of 
magnitude. The second column expresses the fact that the samphng 
errors on any occasion will probably be arranged in the reverse order of 
magnitude, disregarding their signs. The third column shows the corre- 
lation coefficients measured from their mean. The upper p"a are then 
positive, and the lower negative, and also, what is not shown in the 

* Stanley Wyatt, "The Quantitative Investigation of Higher Mental Processes," 
Brit. Journ. Psychol. 1913, vi. p. 131. 
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table, the absolute values increase upwards and downwards from the 

point where the signs change. The fourth (double) column shows the 

probable arrangement of the signs of the quantities e. If the e's are all 

tending to be positive, then the left-hand 

member of the double column gives the 

arrangement, while if the e's all tend to be 

negative, the other member of the double 

column does so. As shown in the last (double) 

column, therefore, the quantities p'e tend 

either to be nearly all negative or nearly all 

positive. For a very small sample the signs 

of p'e will no doubt be quite irregularly 

arranged. But with such a small sample, 

even if p and e were really uncorrelated, it 

would be most unhkely for S (p'e) to be 

negUgible. As the sample increases the signs 

tend to settle down to the above arrange- 
ment, and S (p'e) does not tend to disappear 

compared with S (ee), but only to take on 

one or other of alternative values. It will 

only be zero when all the errors are zero, 
i.e. when no corrections are needed to R'. 
The distribution of S (p'e) about zero in a 
number of samples of the same size will not, 
that is, show a maximum at zero, but a 
minimum, as is shown quahtatively in Fig. 28. 
If, in fact, the actual value of S (p'e) is 
calculated in cases where the true correla- 
tions are known, it is frequently found to be 
greater than the quantities S (ee) which are 
left in the expression. 

The other approximations made in obtain- 
ing the criterion do not appear to be so 
erroneous as this one, though their cumulative 
effect may explain some anomalies. Leaving 
them on one side let us consider the "cor- 
rectional standard " required by Dr Hart and 
Professor Spearman before they admit any pair of columns. It is this 
correctional standard, combined with the pecuhar distribution of R', 
which chiefly is responsible for the exaggeration of perfection produced 




Xansnbajj-^ 



Passed by the correctional standard 
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by this criterion, and for the regularity with which an average value of 
unity is arrived at. 

Let us examine first the actual distribution of the Hart and Spearman 
R' in a psychological hierarchy, viz. that of Wyatt already referred to, 
and calculate R' not only for those columns which pass the correctional 
standard, but also for other pairs of columns. What we find is that its 
value rises as we descend the hierarchy, rushing asymptotically to 
infinity, remaining for a time imaginary, and then returning. Specimen 
values from Mr Wyatt's hierarchy are given. 

Pairs of columns Values of the Hart and Spearman Criterion 

Analogies and Wordboilding 0-93 

Completion and Wordbnilding 0-97 

Completion and Part-wholes 1-05 

Wordbuilding and Part-wholes 0-99 

Part-wholes and Memory (delayed) ... 0-92 . 

Rearranged letters and Missing digits ... 1-17 

Wordbnilding and E R Test 1-26 

Sentence construction and Fables 1-33 

Rearranged letters and E R Test Practically infinity 

Nonsense syllables and dissected pictures ... Imaginary 

Crossline test and Letter Squares 0-35, a meamn^fess value, both factors in the 

denominator being now negative 

Expressed in diagrammatic form this and similar calculations lead 
to the conclusion that in actual practice the criterion is distributed as 
in Fig. 29, where the curve is to be understood as a " best fitting " curve 
among the values of R' scattered, with a very considerable dispersion, 
on both sides of it. The line, in fact, ought to be a broad smudge. 

Now clearly, with a distribution of this sort, it is very important 
that the boundary between the values that are to be rejected and those 
that are to be accepted should be chosen with the greatest care, and not 
arbitrarily but scientifically. Either sound theoretical reasons should 
be given for the choice of the correctional standard, or the choice should 
be based empirically on experiments in material where the truth is 
known a priori, as in the above dice experiments. For obviously, by 
moving this boundary, we can make the final average take on almost 
any value. Another point is that the criterion rushes to infinity at such 
speed that its probable error must be enormous. Dr Hart and Professor 
Spearman, however, give no reasons for their choice of this particular 
standard, upon which depends so much the values they obtain. The 
standard which they thus arbitrarily adopt begins admitting the criteria 
at just such a distance above unity as to balance the cases which give a 
criterion below unity, and entirely explains the remarkable unanimity with 
which this average value unity is obtained by them in their calculations. 
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In otlier words, the remarkable regularity with which this criterion 
gives the value unity is not a property of the investigated correlation 
coefficients at all, but is a property possessed by the criterion itseK, due 
to errors and the action of the "correctional standard." 
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Descending the hierarchy— 
Kg. 29 
In the writers' opinion the work outUned in this chapter finally 
proves the invalidity of Professor Spearman's mathematical argument 
in favour of the Theory of Two Factors. If this be so that theory returns 
to the status of a possible, but unproven, theory. 

(5) HIERARCHICAL ORDER THE NATURAL ORDER AMONG 
CORRELATION COEFFICIENTS 
The fact is that hierarchical order, which Professor Spearman was 
the first to notice among correlation coefficients, is the natural relation- 
ship among these coefficients, on any theory whatever of the cause of 
the correlations, excepting only theories specially designed to prevent 



184 CORRELATION [pt. n 

its occurrence. It is the absence of hierarchical order which would be 
a remarkable phenomenon requiring special explanation; its presence 
requires none beyond what is termed chance. 

An analogy from the simple repeated measurements of a Unear 
magnitude may help to illustrate this. Indeed it is rather more than an 
analogy, being in fact the same phenomenon in its simplest terms and 
dimensions. It is well known that many measurements of the same 
quantity, made with all scientific precautions, under apparently the 
same conditions, and with an avoidance of all known sources of error, 
nevertheless do not give a number of identical values. The values are 
all different, but are not without law and order in their arrangement. 
They are grouped about a centre from which the density decreases in 
both directions, and it is found that this grouping is for most practical 
purposes closely represented by the Normal or Gaussian Curve of Error 
(or one of the more general Pearsonian Curves). 

Experimenters are not surprised to find their data obeying the 
ProbabiUty Law, nor do they require a special theory to explain it. 
On the contrary, it is the departures from this Law which if wide would 
require special investigation, and if confirmed would require a special 
theory. In the same way hierarchical order among correlation coeffi- 
cients should not cause surprise, though any marked variation from this 
order would demand investigation. 

Correlation coefficients are themselves correlated, and n correlation 
coefficients form an w-fold or w-dimensional correlation-surface. The 
particular and convenient form of tabulation of correlation coefficients 
adopted by Professor Spearman and followed by most other psycho- 
logical workers brings to hght, in the form of " hierarchical order, " one 
of the properties of this correlation-surface of the correlations. 

In an article entitled "On the Probable Errors of Frequency Con- 
stants and on the Influence of Random Selection on Variation and 
Correlation," in the Phil. Trans. 1898, cxci. A, pp. 229 — 311, Professor 
Pearson and Mr Filon give the following formulae: 

«r„r„-»-23 2 (I - r,,^) (1 - T,,^) 

f(»'l3 - '■l2'"23) (»'24 - »'23»"34) + i^li " '■l3»'s4) (^ " '"21 '"is) 1 
_ 1+ (^13 - ^14^43) ('"24 - 1'21^u) + i^li - '•lZ>'24) ('"23 - '■24»"4s)> 



2 (1 - r,,^) (1 - r^^) 



so that, as they say, "errors in the correlations of a first organ with a 
second and a third have a correlation themselves of the first order," 
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and "errors in the correlation of two organs and in the correlation of 
a second two have only correlation of the second order." 

Suppose now that the correlations among a number of variates taken 
in pairs are really all the same, and positive, and in a sample let the 
observed value of rgg be the highest observed value : 



Xi x^ 

3=2 


X, 

h 
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Xi x^ 


Xt \ x^ 


Xg 

h 
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X, . . . 


Xg h h 
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h h 
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B, 


h . . . 
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h 
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Xa 


H^ 
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B, 
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h 




Xg h h 


B, 


h h 


flai h 
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h . . . 


*10 


h 






h 





£r,=: highest correlation. £f2=^B<'°'^d highest, and so on. A = tendency to be high. 

Then, because of the above theorems of Pearson and Filon, the rows 
and columns x^ and x^ will probably contain more total correlation than 
do the others, and the second highest correlation will probably be in 
one of these. Let it be rgg. Then, after the rows x^ and Xg the row x^ 
will probably contain many high correlations and rgg will probably be 
the third highest coefficient, because it is a node where two ridges of 
high correlation cross. If it is, then the hierarchy so far is excellent, as 
can be seen on rearranging the square table so as to bring x^, Xg, and x^ 
to the head*. 

Where now is the fourth highest r likely to be found? Somewhere 
among those marked h for high, and slightly more probably among the 
r's of the row x^. If to avoid further rearranging we take it to be rj^, 
then rgj^ is most probably the fifth, and r^j the sixth, because they are 
nodes. And so on. In fact it is clear that when the r's are really equal 

* See next page. 



186 COEEELATION [pt. ii 

to one another, then sampling the population will give a set of observed 
r's which are arranged not in haphazard, but in hierarchical order. 
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In the experiment described above with cards and dice, where 
hierarchical tendency was found to be produced by group and specific 
factors without any general factor, there was, however, no question of 
sampling the population. The hierarchical order already appears in the 
theoretical coefficients. There is here, however, another kind of sampUng 
present, viz. sampUng of the elements which make up the variates. 

Let us suppose, instead of deciding the numbers of groups and specific 
elements as we did by drawing cards, we had dipped into an infinite bag 
containing black balls and white balls, the former representing group 
factors and the latter specific factors. Then the most probable event 
would have been that the proportion of group factors and specific 
factors in each variate would have been the same as the proportion in 
which the balls occurred in the bag. If, in addition, we assume the samples 
drawn to be all the same size, then the most probable result of the whole 
experiment would have been obtaining all the correlations equal. 

That they do not come out equal may be regarded as due to the 
sampling. The samples vary in size, the proportion of group factors 
varies from sample to sample, and the distribution of the individual 
group factors among the several variates departs from the most probable 
distribution. From this point of view the departure of the correlation 
coefficients from equahty is due to errors of sampling, and the apphca- 
tion of the theorem of Pearson and Filon would lead us to expect what 
as we have seen does actually occur, namely, hierarchical order. 
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The experiment with the bag of balls would as a matter of fact not 
produce as great a departure from equality of correlation coefficients 
as IS found in practice in experimental psychology, or as was found in 
our form of the experiment with card drawing. This in the latter case 
IS because the cards give a greater variation of the proportion of group 
to specific factors than would be found with the bag of balls. And this 
is also the case in mental tests, which are not chance samples of the 
mental elements, but are carefully chosen so as to measure different 
kinds of activity. 

This apphcation of Pearson and Filon's theorem (which contemplates 
only samphng errors in the ordinary sense) to the changes in correlation 
produced by samphng the underlying elements, is no doubt somewhat 
novel, and may appear to be a difficulty. 

Further consideration leads to the following resolution of the 
difficulty. 

Suppose that n variates (in our work the scores in mental tests) 
are so connected by factors that the correlations are all equal and 
positive. Then let a small sample of the population be taken. The 
observed correlations will show departures from equahty, and will be 
found to be in hierarchical order. This hierarchical order is due to 
samphng the population. 

Now consider why the correlations do not come out at their true 
values. They give of course the true values /or the sample. The reason 
of their departing from the true values of the whole population is that 
(a) some of the factors which really are links between the variates (the 
mental activities) happen to have remained steadier than usual during 
the sample. In the Umdt a factor might happen to retain exactly the 
same value through the various individuals of the sample. That is, 
some of the hnking factors do not in reahty come into action, or not in 
their full force; (6) on the other hand, some factors which are really 
different and unconnected may happen by chance to rise and fall 
together, through the sample, and more or less to act as one. That is, 
fictitious hnking factors are created, which would disappear with a 
larger sample. 

Clearly therefore a hierarchy of correlation coefficients, caused by 
samphng the population, is due to chance having caused a change in 
the apparent factors acting. It follows that if we make a real change 
in the factors acting, we shall get a hierarchy, and this is what we do 
when we choose the mental tests to be employed in any research. Each 
mental test is a test of a sample of abihties. 
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The laws governing the correlation of correlation coefficients which 
vary because of sampling the population can, in fact, be applied without 
hesitation to the relationships between "true" correlations in the whole 
of any population simply because any such population is itself a sample. 
Enghsh grammar school boys of 12 are themselves a sample of a larger 
boyhood; the whole human race indeed is a sample of " what might have 
been," selected by the struggle for survival. 

The whole question clearly has philosophical bearings on the degree 
of reahty of causal connections; for on this view those chance hnks in 
a small sample which were a few paragraphs ago termed "fictitious links, 
which would disappear with a larger sample," do not differ except in 
degree from the "real" causal Unks which we only term real because 
they persist throughout the largest sample with which we are acquainted. 

In another direction there are connections with the difference, which 
is one of degree only, between what is called "partial" correlation and 
"entire" correlation*. 

The conclusion to be drawn is that hierarchical order is the natural 
order to expect among correlation coefficients, on a theory of chance 
sampUng alone, and that therefore, by the principle of Occam's razor, its 
presence cannot be made the criterion of the existence of any special form 
of causal connection, such as is assumed in the Theory of Two Factors. 

(6) THOMSON'S SAMPLING THEORY OF ABILITYt 
In place therefore of the two factors of that theory, one General 
and the other Specific, Thomson prefers to think of a number of factors 
at play in the carrying out of any activity such as a mental test, these 
factors being a sample of all those which the individual has at his com- 
mand. 

The first reason for preferring this theory is that of Occam's razor. 
It makes fewer assumptions than does the more special form of theory. 
It does not deny General AbiUty, for if the samples are large there will 
of course be factors common to all activities. On the other hand it does 
not assert General Abihty, for the samples may not be so large as this, 
and no single factor may occur in every activity. If moreover a number 
of factors do run through the whole gamut of activities, forming a 
General Factor, this group need not be the same in every individual. 

* See Karl Pearson, "On the Influence of Natural Selection on the Variability and 
Correlation of Organs," Phil. Trans. Roy. Soc. London, 1902, oc. A, pp. 1 — 66; Godfrey 
fl. Thomson, "The Proof or Disproof of the Existence of General Ability," Brit. Journ. 
Psychol. 1919, ix. pp. 321—336. 

t Psychol. Review, 1920, xxvn. p. 183. 
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In other words General Ability, if possessed by any individual, need 
not be psychologically of the same nature as any General Abihty 
possessed by another individual. Everyone has probably known men 
who were good all round, but Jones may be a good all round man for 
different reasons from those which make Smith good all round. 

The Samphng Theory, then, neither denies nor asserts General 
Abihty, though it says it is unproven. Nor does it deny Specific Factors. 
On the other hand it does deny the absence of Group Factors. It is this 
absence of Group Factors which is in truth the crux of Professor 
Spearman's theory, which is not so much a theory of general abihty, 
or a theory of two factors, as a Theory of the Absence of Group Factors. 
And inasmuch as its own disciples have begun to require Group Factors 
to explain their data, its distinguishing mark would appear in any case 
to be disappearing. 

Such Group Factors as are admitted by Professor Spearman are of 
very narrow range, and are mutually exclusive, that is they do not 
overlap. Both these points follow from the sentence used in the 1912 
article with Dr Hart, where it is said that, in the case of performances 
too ahke, "when this Ukeness is diminished, or when the resembhng 
performances are pooled together, a point is soon reached where the 
correlations are still of considerable magnitude, but. now indicate no 
common factor except the General one." 

Since this point is soon reached, the Group Factors must be narrow 
in range. Since poohng a few performances will obhterate any Group 
Factors, they must be exclusive of one another. For if A, B, C and Z> 
are four tests, in which A and B have a Group Factor common to them, 
and C and D another, then of course by pooUng A with B and also C 
with D we can obtain two pools AB and CD which have no hnk. But 
it A, B and C have one Group Factor, and C and D have another, then 
these Group Factors cannot be separated into Specific Factors. In fact, 
a Specific Factor is a separated Group Factor, and Professor Spearman's 
theory asserts that Group Factors, if any, are separable and mutually 
exclusive. This is a great stumbhng-block in the way of the acceptance 
of the Theory of Two Factors, unless perhaps "Specific Factor" is 
interpreted in the way suggested later. 

It is a fact which will be admitted by most that the same activity 
is not performed in the same way by different individuals, even though 
they are equally expert. Not only are Specific Factors therefore required 
by this theory for every separate activity, excluding only any which 
are very closely similar, but also Specific Factors of different psycho- 
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logical natures are required for each individual. Further, the same 
individual does not always perform the same actiAdty in the same way. 
A man using an ergograph will, as he tires, begin to employ muscles 
other than those naturally used at the outset. When we are returning 
from a cycle ride muscles are used in a different manner from the style 
adopted at the start, indeed sometimes deUberate changes are made to 
give relief. And in the same way a mental task is performed by different 
methods at different times. Does this then mean a different Specific 
Factor for each way of doing a task? All these difficulties appear to 
argue against the Theory of Two Factors, and seem to be considerably 
cleared up by the SampUng Theory. 

Finally, the Sampling Theory appears to be in accordance with a 
line of thought which has already proved fruitful in other sciences. 
Any individual is, on the Mendehan theory, a sample of unit quaUties 
derived from his parents, and of these a further sample is apparent 
and expUcit in the individual, the balance being dormant but capable 
of contributing to the sample which is to form his child. It seems a 
natural step further to look upon any activity carried out by this indi- 
vidual as involving a further sample of these quahties. 

(7) THE DIFFICULTY OP "TRANSFER OP TRAINING" 

Although Professor Spearman's Theory of Two Factors has been 
chiefly based by him on the Hne of argument which, it is suggested, 
has now been proved invalid, viz. the "hierarchy" argument, yet there 
is another and powerful form of reasoning which can be brought to 
its support, based upon the fact that, according to some experimenters, 
improvement in any activity due to training does not transfer in any 
appreciable amount to any other activity, except to those very similar 
indeed to the trained activity. And even those workers who do not 
agree that this is an experimental fact are usually content to take a 
defensive attitude and say that transfer is not disproved. Few if any 
will say that it is proved. 

This certainly seems to point to the absence of Group Factors, and 
to support Professor Spearman's theory, which only needs to add to 
itself the assumption that the Specific Factors are, while the General 
Factor is not, capable of being improved by training, to fit the case 
admirably. Of course, if transfer really occurs, the argument proves 
the opposite. And although psychological experiment points on the 
whole to the absence or the narrowness of transfer, yet popular opinion 
among business men, schoolmasters, and others, is in favour of transfer 
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to a considerable extent. Assuming no transfer, however, how can the 
Sampling Theory, with its numerous Group Factors, explain this? 

It is necessary to assume that the Group Factors are all unim- 
provable or only shghtly improvable by training, though they may 
change with the growth and development of the individual. The 
improvement which certainly takes place when we practise any activity 
is due, it may then be assumed, not to improvement in the elemental 
abiUties which form the sample, but to a weeding out, and selection of 
these. The sample alters, mainly no doubt is diminished, though addi- 
tions are also conceivable. It becomes a more economical sample, and 
waste of effort in using elements which are unnecessary is avoided. 
Improvement in any mental activity may on this view be compared 
with improvement in a manual dexterity, in which it is notorious that 
the improvement consists largely in the avoidance of unnecessary 
movements. 

When another activity is then attempted, the elemental factors are 
just the same as they would have been had the practice in the first 
activity not taken place. The new activity will be performed by a new 
group of factors, which sample will as in the first case be in the beginning 
wasteful and will include many unnecessary elements. Transfer of 
improvement gained in the first activity will therefore not take place 
except in so far as the second activity is recognised as a mere variant of 
the original one, in which case the weeding out process which has taken 
place in the first case may be done at the very first attempt, at any rate 
to some extent. 

To use another analogy, the improvement which takes place when 
a football team practises playing together for a series of matches is due 
more to team work than to individual improvement. A new team, even 
though it contain a large proportion of players from the first team, will not 
have this unity of action. There will be httle transfer of improvement. 

According to the view here developed, it is the weeding out of the 
sample of elemental abiUties which is specific. The team work is specific, 
though the players play for several clubs. This would appear to enable 
a reconcihation to be effected between the almost universal behef in 
"types" of abiUty (to which Professor Spearman refers) and the experi- 
mental facts concerning both correlation and transfer. If there be a 
General Factor at all, it might be the power to shake down rapidly into 
good team work, in a word, educability. But there seems no objection 
to assuming that this, instead of being a General Factor, is a property 
of each elemental factor, varying from factor to factor. 
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To sum up this section : if transfer of training really does not occur 
to any great extent, then it has to be admitted that the Theory of Two 
Factors readily explains this. But the Sampling Theory can also do so, 
in a manner which is perhaps not so easy to set forth, but which never- 
theless appears to be more illuminating and less artificial than the 
alternative theory. 

(8) CONCLUSIONS 

Professor Spearman's Theory of Two Factors, which assumes that 
abihty in any performance is due to (a) a General Factor and (b) a Specific 
Factor (Group Factors being absent, or at any rate very narrow in 
range and mutually exclusive), is based chiefly on the observed fact that 
correlation coefficients in psychological tests tend to fall into "hierar- 
chical order." It has been shown, however, that the criterion adopted 
for evaluating the degree of perfection of hierarchical order present is 
untrustworthy and has led to over-estimation. Such hierarchical order 
as is actually present is in fact the natural thing to expect, and it is the 
absence of such which should occasion surprise. The proof of the Theory 
of Two Factors which is based on the presence of hierarchical order 
therefore falls to the ground. The theory remains a possible explanation 
of the facts but ceases to be the unique explanation. As an alternative 
theory Thomson has advanced a Sampling Theory of AbiUty, in which 
any performance is considered as being carried out by a sample of 
Group Factors. This theory is preferred because it makes fewer and less 
special assumptions, because it is more elastic and wider, and because 
it is in closer accord with theories in use in biology and in the study of 
heredity*. 

* For articles on the General Ability controversy which have appeared while this 
book was going through the press the reader is referred to the conclusion of the Biblio- 
graphy, p. 212. 
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TABLES 

1. Fechner's Fundamental Table. 

2. Urban's Tables for the Constant Process. 

3. Table of Muller-Urban Weights. 

4. Eeciprocals of pq, where p + q= 1. 

5. Eich's Checking Table for the Constant Process. 

1. Fechner's Fundamental Table. 



p 


7 


P 


7 


P 


7 


0-50 


00000 


0-67 


0-3111 


0-84 


0-7031 


0-51 


00177 


0-68 


0-3307 


0-85 


0-7329 


0-52 


00355 


0-69 


0-3506 


0-86 


0-7639 


0-53 


00532 


0-70 


0-3708 


0-87 


0-7965 


0-64 


00710 


0-71 


0-3913 


0-88 


0-8308 


0-55 


0-0888 


0-72 


0-4121 


0-89 


0-8673 


0-56 


01067 


0-73 


0-4333 


0-90 


0-9062 


0-57 


0-1247 


0-74 


0-4549 


0-91 


0-9480 


0-58 


01427 


0-75 


0-4769 


0-92 


0-9935 


0-59 


0-1609 


0-76 


0-4994 


0-93 


1-0435 


0-60 


0-1792 


0-77 


0-5224 


0-94 


1-0993 


0-61 


0-1976 


0-78 


0-5460 


0-95 


1-1630 


0-62 


0-2160 


0-79 


0-5702 


0-96 


1-2380 


0-63 


0-2346 


0-80 


0-6951 


0-97 


1-3300 


0-64 


0-2535 


0-81 


0-6208 


0-98 


1-4520 


0-65 


0-2725 


0-82 


0-6473 


0-99 


1-6450 


0-66 


0-2916 


0-83 


0-6747 


I -00 


00 



If J? is < 0-5, look in the Table not for p but for I — p, and take 
y negative. Thus the y f or ^ = 0-25 is - 0-4769. 

This Table is less generally useful than Sheppard's Tables of the 
Probability Integral, Tables I and II of Pearson's Tables for Statisticians 
and Biometrimans (Cambridge University Press). Table I there difEers 
from this only by a factor '\/2 : but gives many more values and to more 
decimal places. 
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2. Urban' s Tables for the Constant Process. 
Prom Archivf. d. gea. Psychol. 1912, xxiv. 240—241*. 



p 


W 


yW 


2 W 


2'W 


2yW 


3W 


S^W 


3yW 


iW 


42 If 


iyW 


0-50 


1-0000 


0-0000 


2-0000 


4-0000 


0-0000 


3-0000 


9-0000 


0-0000 


4-0000 


16-0000 


0-0000 


0-51 


0-9998 


0-0177 


1-9996 


3-9991 


00354 


2-9993 


8-9980 


0-0631 


3-9991 


15-9965 


0-0708 


0-52 


0-9991 


0-0355 


1-9982 


3-9963 


0-0709 


2-9972 


8-9917 


0-1064 


3-9963 


15-9853 


0-1419 


0-53 


0-9980 


0-0531 


1-9959 


3-9918 


0-1062 


2-9938 


8-9816 


0-1693 


3-9918 


15-9672 


0-2124 


0-54 


0-9964 


0-0707 


1-9928 


3-9855 


0-1415 


2-9891 


8-9674 


0-2122 


3-9865 


16-9421 


0-2830 


0-55 


0-9943 


0-0883 


1-9886 


3-9772 


0-1766 


2-9829 


8-9487 


0-2649 


3-9772 


15-9088 


0-3532 


0-56 


0-9918 


01058 


1-9836 


3-9671 


0-2116 


2-9753 


8-9260 


0-3175 


3-9671 


15-8685 


0-4233 


0-57 


0-9888 


0-1233 


1-9776 


3-9551 


0-2466 


2-9663 


8-8990 


0-3699 


3-9561 


15-8205 


0-4932 


0S8 


0-9853 


0-1406 


1-9706 


3-9413 


0-2812 


2-9560 


8-8679 


0-4218 


3-9413 


15-7651 


0-6624 


0-59 


0-9814 


0-1579 


1-9627 


3-9254 


0-3158 


2-9441 


8-8322 


0-4737 


3-9254 


15-7018 


0-6316 


0-60 


0-9768 


0-1750 


1-9537 


3-9074 


0-3601 


2-9306 


8-7916 


0-6252 


3-9074 


16-6296 


0-7002 


0-61 


0-9720 


0-1920 


1-9440 


3-8881 


0-3839 


2-9161 


8-7482 


0-5759 


3-8881 


16-5523 


0-7679 


0-62 


0-9666 


0-2088 


1-9332 


3-8663 


0-4176 


2-8997 


8-6992 


0-6263 


3-8663 


15-4663 


0-8351 


0-63 


0-9607 


0-2254 


1-9214 


3-8429 


0-4508 


2-8822 


8-6465 


0-6762 


3-8429 


15-3715 


0-9015 


0-64 


0-9542 


0-2419 


1-9084 


3-8168 


0-4838 


2-8626 


8-5878 


0-7257 


3-8168 


15-2672 


0-9676 


0-65 


0-9473 


0-2581 


1-8946 


3-7890 


0-5163 


2-8418 


8-6263 


0-7744 


3-7890 


15-1562 


1-0325 


0-66 


0-9398 


0-2741 


1-8797 


3-7594 


0-6481 


2-8196 


8-4586 


0-8222 


3-7594 


160376 


1-0962 


0-67 


0-9317 


0-2899 


1-8634 


3-7268 


0-5797 


2-7951 


8-3853 


0-8696 


3-7268 


14-9072 


1-1594 


0-68 


0-9232 


0-3053 


1-8464 


3-6929 


0-6106 


2-7697 


8-3090 


0-9159 


3-6929 


14-7715 


1-2212 


0-69 


0-9140 


0-3205 


1-8280 


3-6661 


0-6409 


2-7421 


8-2262 


0-9614 


3-6561 


14-6243 


1-2818 


0-70 


0-9043 


0-3353 


1-8085 


3-6170 


0-6706 


2-7128 


8-1383 


1-0059 


3-6170 


14-4682 


1-3412 


0-71 


0-8939 


0-3498 


1-7878 


3-6765 


0-6996 


2-6816 


80449 


1-0493 


3-5756 


14-3021 


1-3991 


0-72 


0-8830 


0-3639 


17669 


3-5318 


0-7277 


2-6489 


7-9466 


1-0916 


3-5318 


141274 


1-4655 


0-73 


0-8713 


0-3775 


1-7426 


3-4852 


0-7541 


2-6139 


7-8417 


1-1326 


3-4852 


13-9408 


1-5101 


0-74 


0-8590 


0-3908 


1-7180 


3-4360 


0-7815 


2-5770 


7-7310 


1-1723 


3-4360 


13-7440 


1-5630 


0-75 


0-8460 


0-4035 


1-6921 


3-3842 


0-8070 


2-6381 


7-6144 


1-2104 


3-3842 


13-5366 


1-6139 


0-76 


0-8323 


0-4157 


1-6646 


3-3293 


0-8313 


2-4970 


7-4909 


1-2470 


3-3293 


13-3171 


1-6626 


0-77 


0-8179 


0-4273 


1-6357 


3-2714 


0-8545 


2-4636 


7-3607 


1-2818 


3-2714 


13-0858 


1-7090 


0-78 


0-8025 


0-4382 


1-6051 


3-2102 


0-8764 


2-4076 


7-2229 


1-3146 


3-2102 


12-8406 


1-7527 


0-79 


0-7865 


0-4484 


1-5729 


3-1459 


0-8969 


2-3594 


7-0782 


1-3453 


3-1459 


12-5835 


1-7938 


0-80 


0-7695 


0-4579 


1-5390 


30780 


0-9159 


2-3085 


6-9255 


1-3738 


3-0780 


12-3120 


1-8317 


0-81 


0-7515 


0-4665 


1-5031 


3-0061 


0-9331 


2-2546 


6-7638 


1-3996 


3-0061 


12-0245 


1-8662 


0-82 


0-7327 


0-4743 


1-4653 


2-9307 


0-9485 


2-1980 


6-5940 


1-4228 


2-9307 


11-7227 


1-8970 


0-83 


0-7129 


0-4810 


1-4257 


2-8515 


0-9619 


21386 


6-4158 


1-4429 


2-8515 


11-4059 


1-9239 


0-84 


0-6921 


0-4866 


1-3842 


2-7683 


0-9732 


2-0762 


6-2287 


1-4698 


2-7683 


11-0733 


1-9464 


0-85 


0-6697 


0-4908 


1-3394 


2-6788 


0-9816 


20091 


6-0273 


1-4725 


2-6788 


10-7152 


1-9633 


0-86 


0-6463 


0-4937 


1-2927 


2-6863 


0-9875 


1-9390 


5-8170 


1-4812 


2-5853 


10-3413 


1-9749 


0-87 


0-6215 


0-4950 


1-2430 


2-4860 


0-9900 


1-8645 


5-5936 


1-4851 


2-4860 


9-9440 


1-9801 


0-88 


0-5953 


0-4946 


1-1907 


2-3813 


0-9892 


1-7860 


5-3580 


1-4838 


2-3813 


9-5253 


1-9784 


0-89 


0-5673 


0-4920 


1-1346 


2-2692 


0-9840 


1-7019 


51056 


1-4760 


2-2692 


9-0766 


1-9680 


0-90 


0-5376 


0-4871 


1-0761 


2-1502 


0-9743 


1-6126 


4-8380 


1-4614 


2-1502 


8-6008 


1-9485 


0-91 


0-6059 


0-4796 


1-0118 


2-0236 


0-9592 


1-5177 


4-6531 


1-4388 


2-0236 


8-0944 


1-9184 


0-92 


0-4718 


0-4687 


0-9435 


1-8871 


0-9374 


1-4153 


4-2459 


1-4061 


1-8871 


7-5483 


1-8748 


0-93 


0-4351 


0-4640 


0-8702 


1-7403 


0-9080 


1-3052 


3-9157 


1-3620 


1-7403 


6-9613 


1-8160 


0-94 


0-3954 


0-4346 


0-7907 


1-5814 


0-8692 


1-1861 


3-6582 


1-3039 


1-5814 


6-3258 


1-7385 


0-95 


03519 


0-4093 


0-7038 


1-4076 


0-8185 


1-0557 


3-1671 


1-2278 


1-4076 


5-6304 


1-6370 


0-96 


0-3036 


0-3759 


0-6073 


1-2146 


0-7618 


0-9109 


2-7328 


1-1277 


1-2146 


4-8582 


1-5036 


0-97 


0-2469 


0-3282 


0-4936 


0-9871 


0-6564 


0-7403 


2-2210 


0-9847 


0-9871 


3-9485 


1-3129 


0-98 


0-1881 


0-2732 


0-3762 


0-7525 


0-5463 


0-5644 


1-6931 


0-8196 


0-7525 


3-0099 


1-0926 


0-99 


0-1127 


0-1854 


0-2254 


0-4508 


0-3708 


0-3381 


1-0142 


0-5561 


0-4508 


1-8030 


0-7415 



* With three oorreotionB, two of which were made by Urban in the Praxis der Kon- 
aUmzmethode, and the third by Rich in Amer. Journ. Psychol. 1918, xxix. 121. The last 
was also rediscovered by the Cambridge Univ. Press proof reader. 
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2. Urban' s Tables for the Constant Process (contd.). 

From Archivf. d. ges. Psychol 1912, xxrv. 240—241. 



p 


5W 


52 r 


5yW 


6 W 


6'W 


6yW 


7W 


VW 


TyW 


0-50 


50000 


25-0000 


0-0000 


60000 


36-0000 


0-0000 


7-0000 


49-0000 


0-0000 


0-51 


4-9989 


24-9945 


0-0885 


5-9987 


35-9921 


0-1062 


6-9985 


48-9892 


0-1239 


0-52 


4-9954 


24-9770 


0-1773 


5-9945 


35-9669 


0-2128 


6-9936 


48-9549 


0-2483 


0-53 


4-9898 


24-9488 


0-2655 


5-9877 


35-9262 


0-3185 


6-9856 


48-8996 


0-3716 


0-54 


4-9819 


24-9095 


0-3537 


5-9783 


35-8697 


0-4251 


6-9747 


48-8266 


0-4960 


0-55 


4-9715 


24-8575 


0-4415 


5-9658 


35-7948 


0-5298 


6-9601 


48-7207 


0-6181 


0-56 


4-9589 


24-7945 


0-5291 


5-9507 


35-7041 


0-6349 


6-9425 


48-5972 


0-7408 


0-57 


4-9439 


24-7195 


0-6165 


5-9327 


35-5961 


0-7398 


6-9215 


48-4502 


0-8631 


0-58 


4-9266 


24-6330 


0-7030 


5-9119 


35-4715 


0-8436 


6-8972 


48-2807 


0-9842 


0-59 


4-9068 


24-5340 


0-7895 


5-8882 


35-3290 


0-9474 


6-8695 


48-0866 


1-1053 


0-60 


4-8842 


24-4212 


0-8753 


5-8611 


35-1666 


1-0503 


6-8380 


47-8656 


1-2254 


0-61 


4-8601 


24-3005 


0-9599 


5-8321 


34-9927 


1-1518 


6-8041 


47-6290 


1-3438 


0-62 


4-8329 


24-1645 


1-0439 


5-7995 


34-7969 


1-2527 


6-7661 


47-3624 


1-4615 


0-63 


4-8036 


240180 


11269 


5-7643 


34-5859 


1-3523 


6-7250 


47-0753 


1-5777 


0-64 


4-7710 


23-8550 


1-2094 


5-7252 


34-3512 


1-4513 


6-6794 


46-7558 


1-6932 


0-65 


4-7363 


23-6815 


1-2906 


5-6836 


34-1014 


1-5488 


6-6308 


46-4157 


1-8069 


0-66 


4-6992 


23-4962 


1-3703 


5-6391 


33-8346 


1-6444 


6-5790 


460526 


1-9184 


0-67 


4-6585 


23-2925 


1-4493 


5-5902 


33-5412 


1-7391 


6-5219 


45-6533 


2-0290 


0-68 


4-6161 


23-0805 


1-5265 


5-5393 


33-2359 


1-8319 


6-4625 


45-2378 


21372 


0-69 


4-5701 


22-8505 


1-6023 


5-4841 


32-9047 


1-9227 


6-3981 


44-7870 


2-2432 


0-70 


4-5213 


22-6065 


1-6765 


5-4256 


32-5534 


20118 


6-3298 


44-3087 


2-3471 


0-71 


4-4694 


22-3470 


1-7489 


5-3633 


32-1797 


2-0987 


6-2572 


43-8001 


2-4484 


0-72 


4-4148 


22-0740 


1-8193 


5-2978 


31-7866 


2-1832 


61807 


43-2650 


2-5471 


0-73 


4-3565 


21-7825 


1-8877 


5-2278 


31-3668 


2-2652 


6-0991 


42-6937 


2-6427 


0-74 


4-2950 


21-4750 


1-9538 


5-1540 


30-9240 


2-3446 


6-0130 


42-0910 


2-7353 


0-75 


4-2302 


21-1510 


2-0174 


5-0762 


30-4574 


2-4209 


5-9223 


41-4560 


2-8243 


0-76 


4-1616 


20-8080 


2-0783 


4-9939 


29-9635 


2-4940 


5-8262 


40-7837 


2-9096 


0-77 


4-0893 


20-4465 


2-1363 


4-9072 


29-4430 


2-5635 


5-7250 


40-0751 


2-9908 


0-78 


4-0127 


20-0635 


2-1909 


4-8152 


28-8914 


2-6291 


5-6178 


39-3245 


3-0673 


0-79 


3-9324 


19-6618 


2-2422 


4-7188 


28-3129 


2-6907 


5-5053 


38-5370 


3-1391 


0-80 


3-8475 


19-2375 


2-2896 


4-6170 


27-7020 


2-7476 


5-3865 


37-7055 


3-2055 


0-81 


3-7576 


18-7882 


2-3327 


4-5092 


27-0551 


2-7993 


5-2607 


36-8250 


3-2658 


0-82 


3-6634 


18-3168 


2-3713 


4-3960 


26-3761 


2-8455 


5-1287 


35-9008 


3-3198 


0-83 


3-5644 


17-8218 


2-4049 


4-2772 


25-6631 


2-8858 


4-9901 


34-9306 


3-3668 


0-84 


3-4604 


17-3020 


2-4330 


4-1525 


24-9149 


2-9196 


4-8447 


33-9119 


3-4062 


0-85 


3-3485 


16-7425 


2-4541 


4-0182 


24-1092 


2-9449 


4-6879 


32-8153 


3-4358 


0-86 


3-2316 


161582 


2-4687 


3-8780 


23-2679 


2-9624 


4-5243 


31-6702 


3-4561 


0-87 


3-1075 


15-5375 


2-4751 


3-7290 


22-3740 


2-9701 


4-3505 


30-4535 


3-4652 


0-88 


2-9766 


14-8832 


2-4730 


3-5720 


21-4319 


2-9676 


4-1673 


291712 


3-4622 


0-89 


2-8364 


14-1822 


2-4600 


3-4037 


20-4222 


2-9521 


3-9710 


27-7972 


3-4441 


0-90 


2-6878 


13-4388 


2-4356 


3-2253 


19-3518 


2-9228 


3-7628 


26-3400 


3-4099 


0-91 


2-5295 


12-6475 


2-3980 


30354 


18-2124 


2-8776 


3-5413 


24-7891 


3-3572 


0-92 


2-3588 


11-7942 


2-3435 


2-8306 


16-9837 


2-8122 


3-3024 


23-1167 


3-2809 


0-93 


2-1754 


10-8770 


2-2700 


2-6105 


15-6629 


2-7240 


30456 


21-3189 


3-1780 


0-94 


1-9768 


9-8840 


2-1731 


2-3722 


14-2330 


2-6077 


2-7675 


19-3726 


3-0423 


0-95 


1-7595 


8-7975 


2-0463 


21114 


12-6684 


2-4556 


2-4633 


17-2431 


2-8648 


0-96 


1-5182 


7-5910 


1-8795 


1-8218 


10-9310 


2-2554 


2-1255 


14-8784 


2-6313 


097 


1-2339 


6-1695 


1-6411 


1-4807 


8-8841 


1-9693 


1-7275 


12-0922 


2-2975 


0-98 


0-9406 


4-7030 


1-3658 


1-1287 


6-7723 


1-6389 


1-3168 


9-2179 


1-9121 


0-99 


0-^634 


2-7172 


0-9269 


0-6761 


3-9568 


11123 


0-7888 


5-4218 


1-2976 
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p 


W 


P 


W 


P 


W 


0-50 


1000 


0-67 


0-932 


0-84 


0-694 


0-51 


1000 


0-68 


0-923 


0-85 


0-670 


0-52 


0-999 


0-69 


0-914 


0-86 


0-646 


0-53 


0998 


0-70 


0-904 


0-87 


0-621 


0-54 


0-996 


0-71 


0-894 


0-88 


0-595 


0-55 


0-995 


0-72 


0-883 


0-89 


0-567 


0-56 


0-992 


0-73 


0-871 


0-90 


0-538 


0-57 


0-989 


0-74 


0-859 


0-91 


0-506 


0-58 


0-985 


0-75 


0-846 


0-92 


0-472 


0-59 


0-981 


0-76 


0-832 


0-93 


0-435 


0-60 


0-977 


0-77 


0818 


0-94 


0-396 


0-61 


0-972 


0-78 


0-803 


0-95 


0-352 


0-62 


0-967 


0-79 


0-787 


0-96 


0-304 


0-63 


0-960 


0-80 


0-770 


0-97 


0-249 


0-64 


0-954 


0-81 


0-752 


0-98 


0187 


0-65 


0-947 


0-82 


0-733 


0-99 


0-112 


0-66 


0-940 


0-83 


0-713 


1-00 


0-000 



The weight oi a. p which is less than 0-5 is the same as the weight of 
a p which exceeds 0*5 by the same amount. Thus the weights oip = 0*25 
and of p = 0-75 are both alike, = 0-846. 

* The table is quoted from F. M. Urban, "The Method of Constant Stimuli and its 
Generalisations," PsychologicalMeview, 1910, xvn. p. 253. See also "Die psychophysischen 
Massmethoden als Criundlagen empirischer Meaaungen," by the same author, Archivf. d. 
get. Psychologie, xv. and xvi. 

4. Reciprocals of pq, where p + q = 1. 





1 




1 




1 


p OT q 


pq 


p or q 


pq 


poiq 


pq 


■50 


4^0 


•67 


4-5 


■84 


7^5 


-51 


4^0 


•68 


46 


■85 


7^9 


•52 


4-0 


•69 


4-7 


■86 


8^3 


•53 


40 


•70 


4^8 


■87 


8^8 


•54 


4-0 


•71 


4-9 


■88 


9-4 


■55 


4^1 


•72 


5^0 


■89 


102 


•56 


4^1 


•73 


51 


■90 


111 


•57 


41 


•74 


52 


■91 


122 


•58 


41 


•75 


53 


■92 


136 


•59 


4^1 


•76 


5^4 


■93 


154 


■60 


4^2 


•77 


56 


■94 


17-7 


•61 


4-2 


•78 


5^8 


■95 


210 


■62 


43 


■79 


60 


■96 


260 


•63 


4-3 


■80 


6^2 


■97 


344 


•64 


4.4 


■81 


6-5 


•98 


51^0 


•65 


4.4 


•82 


6^8 


•99 


101 


•66 


4^5 


•83 


7^1 


1-00 
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5. Rich's Checking Table for the Constant Method. 

Published from the Laboratory of Cornell University, in 
Amer. Journ. Psychol. 1918, xxix. 120. 

An example will best explain the use of this Table. Consider the 
example worked on p. 73. If we form for each hne the totals of the 
five quantities 

we get the results 



+ yT 


V + sW + 


sW + SyW 


s 


p 


Totals 


-6 


•00 


•0000 


-5 


•10 


132371 


-4 


•14 


9-8835 


-3 


•40 


7-1880 


-2 


■65 


2-5836 





•80 


12274 


2 


•87 


6^8355 


4 


•96 


8-2559 


6 


1^00 


00000 



48-2110 

These totals however will all be found in Eich's Table entered with 
the proper s (Rich's x) and p, and can thus be checked. The grand total 
48'2110 ought to agree with the sum of the last row of the table (4) on 
p. 73, i.e. with 

4-8026 + 0-4311 - 7-6406 + 43-7049 + 6-9130. 



198 



APPENDIX I 



Rich's Checking Table. 



p 


x = l 


x=2 


a;=3 


X=4: 


x=S 


x=6 


a;=7 


•01 


- 0327 


-2327 


■7235 


14396 


2-2810 


3-4479 


48403 


•02 


•0179 


-4973 


13529 


2^5847 


4-1927 


6-1770 


85375 


•03 


•0843 


-7430 


18953 


3-5414 


5-6810 


8-3142 


11-4409 


■04 


•1590 


•9978 


24437 


4^4969 


7-1574 


10-4251 


14-3003 


•06 


•2371 


12355 


29376 


53436 


8-4533 


12-2668 


16-7842 


•06 


■3170 


14637 


34012 


61296 


9-6485 


13-9583 


19-0686 


•07 


■3973 


1-6836 


38400 


6^8667 


10-7635 


15-5305 


21-1676 


•08 


■4780 


1-8963 


42582 


76637 


11-8126 


17-0062 


23-1413 


•09 


■5585 


2-1025 


4^6583 


8^2259 


12-8053 


18-3965 


24-9995 


•10 


•6386 


2-3016 


50397 


8^8530 


137415 


19-7048 


26-7434 


•11 


•7179 


2-4951 


5^4068 


94531 


14-6339 


20-9491 


28-3994 


•12 


•7967 


2-6835 


5^7609 


100289 


15-4875 


221370 


29-9770 


•13 


•8745 


2-8655 


6^0994 


10^5764 


16-2964 


232594 


31-4653 


•14 


■9515 


3-0431 


6-4274 


11-1043 


170737 


243361 


32-8910 


•15 


10275 


3-2155 


6-7428 


11-6096 


178168 


263614 


34-2463 


•16 


11031 


3-3848 


7-0606 


12-1007 


186349 


263533 


35-5559 


•17 


11767 


3-5472 


7-3434 


12-5654 


192132 


27^2864 


36-7858 


•18 


12495 


3-7059 


7-6276 


13-0148 


198673 


28^1850 


37-9681 


•19 


13215 


3-8611 


7-9038 


13-4494 


204981 


290500 


391049 


•20 


13927 


40127 


8-1718 


13-8699 


211070 


29-8830 


40-1981 


•21 


1-4627 


41600 


8-4304 


14-2737 


216901 


30-6791 


41-2413 


•22 


1-5311 


4-3032 


8-6802 


14-6624 


22^2496 


31-4418 


42-2393 


•23 


1-5991 


4-4432 


8-9231 


15-0388 


227901 


32-1773 


43-1999 


•24 


1-6655 


4-5792 


91575 


16-4004 


23-3079 


32-8800 


44-1169 


•25 


1-7310 


4-7118 


9-3846 


16-7494 


23-8063 


33-5552 


449965 


■26 


1-7954 


4-8407 


9-6039 


160852 


24-2844 


34-2016 


45-8369 


•27 


1-8589 


4-9675 


9-8168 


16-4097 


24-7451 


34-8232 


46-6439 


•28 


1-9212 


50891 


10-0230 


16-7228 


26-1886 


36-4203 


47-4177 


•29 


1-9821 


5-2078 


10-2213 


17-0226 


26-6116 


35-9884 


481530 


•30 


20423 


5-3239 


10-4142 


17-3130 


26-0203 


36-6362 


48-8604 


•31 


2-1010 


6-4367 


10-6004 


17-5921 


26-4118 


37-0596 


49-5354 


•32 


21590 


6-5466 


107807 


17-8611 


26-7880 


37-5612 


50-1810 


■33 


22153 


6-6523 


109526 


18-1164 


27-1435 


380341 


50-7880 


■34 


22712 


5-7567 


111217 


18-3665 


27-4908 


38-4960 


51-3789 


■35 


2-3257 


5-8564 


112819 


18-6019 


27-8164 


38-9264 


51-9288 


•36 


2-3788 


6-9637 


11 4370 


18-8287 


28-1289 


39-3374 


52-4543 


•37 


2-4313 


6-0488 


11 ■5878 


19-0482 


28-4300 


39-7332 


52-9579 


•38 


2-4822 


6-1397 


11^7304 


19-2543 


28-7113 


40-1015 


53-4248 


■39 


2-5320 


6-2282 


lb8684 


19-4525 


28-9807 


40-4530 


53-8693 


•40 


2-5804 


6-3128 


119988 


19-6386 


29-2319 


40-7792 


54-2800 


•41 


2-6284 


6-3968 


121261 


19-8191 


29-4748 


410933 


54-6743 


•42 


2-6747 


6-4754 


122468 


19-9887 


29-7013 


41-3845 


550384 


•43 


2-7198 


6-5616 


12^3609 


20-1479 


29-9124 


41-6545 


55-3741 


•44 


2-7638 


6-6251 


124698 


20-2983 


30-1103 


41-9059 


55-6849 


45 


2-8063 


6-6952 


125727 


20-4388 


30-2935 


42-1368 


55-9687 


•46 


2-8478 


6-7625 


12^6700 


20-5703 


30-4634 


42-3486 


56-2310 


•47 


2-8878 


6-8264 


12-7610 


20-6915 


30-6180 


42-5403 


56-4585 


•48 


2-9263 


6-8872 


12-8461 


20-8033 


30-7587 


42-7122 


56-6638 


•49 


2-9640 


6-9454 


12-9263 


20-9069 


30-8870 


42^8667 


56-8459 


•50 


30000 


7^0000 


130000 


21-0000 


31-0000 


430000 


670000 
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p 


x = \ 


x = i 


a; = 3 


a;=4 


x=6 


x = Q 


x=l 


■51 


30348 


7-0516 


13-0679 


2b0839 


31-0994 


43-1145 


571291 


■52 


30683 


7-1000 


13-1299 


21-1581 


31-1843 


43-2088 


57^2314 


■53 


3'1002 


7-1450 


13^1858 


212225 


31-2552 


43-2835 


57^3079 


■54 


31306 


7^1869 


132358 


21-2777 


31-3122 


43-3402 


57-3644 


•55 , 


31595 


7^2250 


132791 


21-3218 


31-3531 


43-3730 


573815 


■56 


31870 


7-2599 


13-3164 


21-3565 


31-3801 


43-3873 


57^3781 


■57 


32130 


7-2914 


13^3473 


21-3809 


31-3920 


43-3807 


57^3469 


■58 


32371 


7-3190 


13-3716 


21-3947 


31-3885 


433529 


57^2880 


■59 


32600 


7-3432 


13-3893 


21-3981 


31-3696 


433039 


57^2007 


•60 


3-2804 


7-3630 


13-3992 


21-3890 


31-3325 


43-2298 


57^0808 


■61 


33000 


7-3800 


13-4042 


213723 


31-2845 


431406 


56^9409 


■62 


33174 


7-3925 


13-4006 


213421 


31-2167 


430245 


56^7654 


■63 


33329 


7-4012 


13-3910 


21-3020 


31-1346 


42^8886 


565641 


■64 


3-3464 


7^4051 


13-3722 


21-2477 


31-0315 


42^7238 


563245 


■65 


33581 


7-4052 


13-3469 


21-1831 


30-9138 


425392 


56^0588 


■66 


33676 


7-4011 


13-3143 


21-1071 


30-7796 


423320 


557639 


■67 


3-3749 


7-3915 


13-2716 


21-0150 


30-6219 


420921 


554258 


■68 


3-3802 


7-3784 


13-2231 


20-9141 


30-4516 


418356 


55^0660 


■69 


3-3830 


7-3595 


13-1642 


20-7967 


30-2574 


41^5460 


54^6628 


■70 


3-3835 


7-3357 


13-0966 


20-6660 


30-0439 


412304 


54^2252 


■71 


3-3813 


7-3066 


13-0195 


20-5204 


29-8090 


40^8854 


537494 


•72 


3-3768 


7-2723 


12-9340 


20-3616 


29-5550 


405145 


53-2397 


•73 


3-3689 


7^2307 


12-8370 


20-1849 


29-2755 


401086 


52-6843 


•74 


3-3586 


7^1853 


127301 


19-9928 


28-9736 


396724 


52-0891 


•75 


3-3450 


7-1328 


126124 


19-7842 


28-6481 


392040 


51-4521 


•76 


3-3283 


7-0732 


12-4829 


19-5570 


28-2959 


38-6994 


50-7675 


•77 


33083 


7-0068 


12-3413 


19-3114 


27-9173 


38-1589 


50-0361 


•78 


32839 


6-9324 


12^1858 


19-0442 


27-5078 


37^6764 


49-2503 


•79 


32563 


6-8506 


12^0178 


18-7581 


27-0713 


369573 


484163 


•80 


32243 


6^7603 


11-8352 


18-4491 


26-6020 


362940 


47^5249 


•81 


31875 


6-6603 


116360 


18-1148 


26-0965 


35^5816 


46^5695 


•82 


3-1467 


6-5515 


11-4218 


17-7574 


25-5585 


34^8246 


455563 


•83 


3-1007 


64330 


111912 


173752 


24-9850 


34^0200 


44^4814 


•84 


30495 


6^3044 


109434 


169667 


24-3741 


331657 


433415 


•85 


2-9907 


6^1603 


10^6694 


16^5178 


237056 


322328 


420995 


•86 


2-9263 


6-0055 


10-3772 


160415 


22-9985 


31-2483 


40^7906 


•87 


2-8545 


5-8355 


100596 


155266 


222366 


30-1896 


393857 


•88 


2-7751 


5-6511 


9-7177 


149749 


21^4227 


29-0614 


37^8906 


•89 


2-6859 


5-4471 


9-3428 


143731 


205379 


27-8373 


362716 


■90 


2-5870 


5^2243 


8-9367 


13^7242 


195869 


26-5246 


345374 


■91 


2-4769 


4-9801 


8-4951 


13-0219 


18-5605 


25-1109 


326731 


■92 


2-3528 


4-7085 


8-0078 


12-2507 


174370 


23-5670 


306405 


•93 


2-2133 


4-4076 


7-4720 


11-4067 


162115 


21-8865 


28^4316 


■94 


2-0554 


4-0713 


6-8782 


10-4757 


148639 


200429 


26^0124 


•95 


1-8743 


3-6911 


6-2118 


9-4362 


133645 


17-9966 


233324 


'96 


1-6626 


3-2532 


5-4509 


8-2559 


11^6682 


15^6877 


20^3147 


•97 


1-3971 


2-7122 


45211 


6-8236 


96196 


129092 


166923 


•98 


1-1107 


2-1363 


35383 


5-3163 


7-4707 


100012 


129081 


•99 


-7089 


1-3451 


2-2065 


3-2934 


4-5056 


60433 


7^8063 
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V 


a:=-7 


^=-6 


x=-5 


x=-i 


x=-Z 


x=-i 


x=-l 


a;=0 


■01 


5-8579 


4-3203 


3-0080 


2-0210 


1-1595 


•6235 


•1127 


- -0727 


•02 


97281 


7-1974 


5-0431 


3-2649 


1-8631 


•8375 


-1881 


-■0851 


•03 


125809 


92914 


6-4954 


41930 


2-3841 


10686 


•2469 


-0813 


•04 


15-3119 


112923 


7-8800 


5-0749 


2-8773 


b2868 


•3036 


- 0723 


•06 


175872 


129552 


9-0269 


5-8024 


3-2818 


1^4649 


•3619 


-•0674 


•06 


19-6082 


144293 


100411 


6-4437 


3-6368 


1-6207 


•3954 


-0392 


•07 


21-4324 


15-7575 


109527 


7-0181 


3-9536 


1-7692 


•4361 


-0189 


•08 


23-0983 


16^9684 


11^7820 


7-5391 


4-2398 


1-8841 


•4718 


•0031 


•09 


24-6313 


18^0809 


12-5423 


8-0155 


4-5005 


1-9973 


•5059 


•0263 


•10 


26-0376 


190998 


13-2371 


8-4496 


4-7373 


2-0999 


•5376 


•0505 


•11 


27-3456 


200459 


138811 


8-8507 


4-9550 


21939 


•5673 


■0753 


•12 


28-5668 


20^9282 


14-4803 


9-2231 


5-1565 


2-2805 


•5953 


■1007 


■13 


29-6947 


217416 


15-0316 


9-5646 


53406 


2-3595 


-6216 


■1265 


•14 


30-7546 


22-5049 


15-5479 


9-8835 


5^5118 


2-4327 


•6463 


■1526 


•15 


31-7421 


23-2148 


16-0270 


10-1786 


5-6696 


2-4999 


•6697 


•1789 


•16 


32-6789 


23-8875 


16-4801 


104569 


5-8178 


2-5628 


•6921 


•2056 


•17 


33-5392 


24-5036 


16-8942 


107102 


5-9520 


2-6196 


•7129 


■2319 


•18 


34-3503 


25-0840 


17-2831 


10^9474 


6-0772 


2-6723 


•7327 


■2584 


•19 


35-1151 


25-6302 


17-6483 


111696 


6-1938 


2-7211 


•7615 


■2850 


•20 


35-8361 


261442 


17-9912 


113773 


6-3024 


2-7665 


•7695 


■3116 


•21 


36-5089 


26-6229 


18-3097 


115695 


6^4022 


2-8080 


•7865 


•3381 


■22 


37-1383 


27^0696 


18-6060 


117474 


6-4942 


2-8458 


•8025 


•3643 


■23 


37-7315 


27^4899 


18-8841 


U-9140 


6-6795 


2-8808 


•8179 


■3906 


■24 


38-2837 


27^8802 


19-1413 


120670 


6-6676 


2-9126 


•8323 


-4166 


•25 


38-8005 


282446 


19-3807 


12^2088 


6^7292 


2-9416 


•8460 


•4426 


•26 


39-2815 


28^5828 


19-6020 


12-3392 


6-7946 


2-9677 


•8690 


•4682 


•27 


39-7311 


28^8980 


19-8075 


124595 


6-8542 


2-9905 


•8713 


•4938 


•28 


40-1505 


291911 


19-9976 


12-5702 


6-9084 


30127 


•8830 


•6191 


•29 


40-5354 


294592 


20-1706 


12-6698 


6-9567 


30314 


•8939 


•5441 


•30 


40-8950 


29^7086 


20-3307 


12-7614 


7-0004 


3^0481 


•9043 


•5690 


•31 


41-2256 


29-9368 


20-4762 


12-8435 


7-0390 


30625 


•9140 


•5935 


•32 


41-5304 


30-1464 


20-6088 


12-9177 


7-0731 


30750 


•9232 


•6179 


•33 


41-8022 


30-3319 


20-7251 


12-9816 


7-1016 


3^0849 


•9317 


•6418 


•34 


42-0577 


30-5056 


20-8330 


13-0401 


7-1269 


30936 


•9398 


•6657 


•35 


42-2810 


30-6558 


20-9250 


13-0889 


71471 


3-1000 


•9473 


•6892 


■36 


42-4819 


30-7896 


21-0057 


13-1303 


7-1632 


3-1046 


•9542 


•7123 


•37 


42-6633 


30-9092 


21-0766 


13-1654 


7-1758 


3-1076 


•9607 


•7353 


•38 


42-8156 


310079 


21-1333 


13-1919 


7-1836 


3-1085 


•9666 


•7578 


•39 


42-9487 


3b0924 


21-1803 


13-2121 


7-1880 


3-1080 


•9720 


•7800 


•40 


430548 


311576 


21-2141 


13-2242 


7-1880 


3-1056 


•9768 


•8018 


•41 


43-1459 


312117 


21-2402 


13-2315 


7-1853 


3-1020 


•9814 


•8235 


■42 


43-2124 


31^2479 


21-2541 


13-2309 


7-1784 


3-0966 


•9853 


-8447 


■43 


43-2573 


31-2687 


21-2576 


13-2241 


7-1681 


3-0896 


•9888 


-8655 


■44 


43-2815 


312743 


21-2507 


13-2107 


7-1542 


3-0811 


•9918 


-8860 


•45 


43-2847 


31^2648 


21-2335 


13-1908 


7-1367 


3-0712 


•9943 


-9060 


•46 


43-2736 


31-2422 


21-2070 


13-1653 


7-1162 


3-0599 


•9964 


-9257 


•47 


43-2305 


31-2019 


21-1694 


13-1327 


7-0920 


3-0470 


•9980 


-9449 


•48 


431732 


311488 


21-1225 


130945 


7-0645 


30326 


•9991 


■9636 


•49 


43-0967 


31-0817 


21-0662 


13-0503 


7-0339 


3-0170 


•9998 


■9821 


■50 


430000 


310000 


21-0000 


13-0000 


7-0000 


3-0000 


1-0000 


10000 
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Checking Table. 








p 


x=-l 


x=-% 


x=-5 


X=-4: 


x=-3 


x=-2 


x=-l 


x=0 


•51 


42-8843 


30-9047 


20-9246 


12-9441 


6-9631 


2^9816 


-9998 


1^0175 


•52 


42-7476 


30-7942 


20-8389 


12-8817 


6-9227 


2-9618 


•9991 


10346 


•53 


42-5935 


30-6711 


20-7446 


12-8141 


6-8796 


2-9408 


•9980 


10511 


•54 


42-4230 


30-5334 


20-6410 


12-7407 


6-8332 


2-9183 


•9964 


10671 


•55 


42-2251 


30-3818 


20-5271 


12-6610 


6-7835 


2-8946 


•9943 


10826 


•56 


420115 


30-2161 


20-4041 


12-5757 


6-7308 


2-8695 


•9918 


1^0976 


•57 


41^7777 


30-0357 


20-2712 


12-4843 


6-6749 


2-8430 


•9888 


11121 


•58 


41-5252 


29-8419 


20-1293 


12-3873 


6-6160 


2-8154 


•9853 


11259 


•59 


41-2511 


29-6327 


19-9770 


12-2841 


65537 


2-7862 


•9814 


11393 


•60 


40-9540 


29-4070 


19-8135 


12-1738 


6^4876 


2-7554 


•9768 


1-1518 


•61 


40-6451 


29-1728 


19-6445 


12-0603 


6^4202 


2-7242 


•9720 


1-1640 


•62 


40-3102 


28-9201 


19-4631 


11-9393 


6-3486 


2-6909 


•9666 


11754 


•63 


39-9587 


28^6554 


19-2736 


11-8132 


6-2742 


2-6568 


•9607 


1^1861 


•64 


39-5793 


28^3708 


19-0707 


11-6789 


6-1956 


2-6207 


•9542 


11961 


•65 


39-1834 


28^0744 


18-8600 


ir5401 


6-1145 


2-5836 


•9473 


1^2054 


•66 


38-7691 


27^7650 


18-6406 


113959 


6-0307 


2-5455 


-9398 


12139 


•67 


38-3240 


274335 


18-4063 


11^?426 


5-9422 


2-5053 


-9317 


12216 


•68 


37-8666 


27^0932 


18-1664 


11 •0859 


5-8519 


2-4644 


-9232 


1^2286 


•69 


37-3802 


267324 


17-9126 


10-9209 


5-7572 


2-4217 


-9140 


12345 


•70 


36-8714 


26^3556 


17-6483 


10-7496 


5-6592 


2-3775 


-9043 


1^2396 


•71 


36-3382 


259614 


17-3724 


10-5712 


6-5577 


2-3318 


-8939 


12437 


•72 


357841 


255525 


17-0868 


10-3870 


5-4530 


2-2851 


•8830 


12469 


•73 


35-2007 


251226 


16-7871 


10-1943 


5-3440 


2-2373 


■8713 


1^2488 


•74 


34-5925 


24^6752 


16-4760 


9-9948 


5-2315 


2-1863 


-8590 


12498 


•75 


339589 


24^2098 


16-1529 


9-7880 


5-1164 


21346 


•8460 


12495 


•76 


332959 


237236 


15-8161 


9-5732 


4-9949 


20814 


•8323 


1^2480 


•77 


32^6045 


23-2175 


15-4661 


9-3506 


4-8705 


20264 


•8179 


12452 


•78 


318801 


22-6878 


151006 


9-1184 


4-7414 


1-9694 


•8025 


12407 


•79 


31^1275 


22-1383 


14-7221 


8-8787 


4-6084 


1-9110 


•7865 


1-2349 


•80 


303409 


21-5648 


14-3278 


8-6297 


4-4706 


1-8505 


•7696 


1-2274 


•81 


29-5165 


20-9646 


13-9159 


8-3702 


4-3276 


1-7879 


•7515 


1-2180 


•82 


28-6593 


20-3416 


13-4891 


8-1020 


4-1802 


1-7239 


•7327 


1-2070 


•83 


27-7676 


19-6940 


13-0464 


7-8244 


4-0282 


1-6578 


•7129 


11939 


•84 


26-8397 


19-0215 


12-5873 


7-5373 


3-8714 


1-5896 


•6921 


1-1787 


•85 


25-8521 


18-3066 


12-1004 


7-2336 


3-7062 


1-5183 


•6697 


1-1605 


•86 


24-8298 


17-5675 


11-5979 


6-9211 


3-5368 


1-4451 


•6463 


11400 


•87 


23-7543 


16-7914 


11-0714 


6-5944 


3-3604 


1-3695 


-6215 


11165 


•88 


22-6316 


15-9822 


10-5235 


6-2555 


3-1781 


1-2913 


-5953 


1-0899 


•89 


21-4414 


15-1257 


9-9451 


5-8987 


2-9870 


1-2099 


-5673 


1-0593 


•90 


20-1920 


14-2284 


9-3401 


5-5268 


2-7887 


1-1255 


■5376 


1-0247 


•91 


18-8761 


13-2849 


8-7055 


5-1379 


2-5821 


1-0381 


■5059 


•9855 


•92 


17-4739 


12-2814 


8-0324 


4-7269 


2-3650 


-9467 


■4718 


•9405 


•93 


15-9844 


11-2175 


7-3207 


4-2941 


2-1376 


•8512 


■4351 


•8891 


•94 


14-3928 


10-0831 


6-5641 


3-8359 


1-8982 


•7515 


■3954 


•8300 


•95 


12-6762 


8-8626 


5-7529 


3-3470 


1-6448 


•6465 


•3519 


•7612 


•96 


10-8011 


7-5333 


4-8728 


2-8195 


1-3737 


•5350 


•3036 


•6795 


•97 


8-6423 


6-0092 


3-8696 


2-2236 


10711 


•4122 


•2469 


•5751 


•98 


6-4503 


4-4660 


2-8579 


1-6261 


■7705 


•2913 


•1881 


•4613 


•99 


36335 


2-4665 


1-5250 


-9088 


•4181 


•1527 


•1127 


•2981 
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• zero. 



D=\ z^e-^dz = ^, i = zero. 

Jo J -00 

^ = j^ z»e-^ dz = - 2. 2. 2 ■■■2 2 ^"^ ^''^"^ 
1.2.4.6...(M-1), ,,. 
"'^ 2.2.2.2...2 (''"•^•*)- 

F= r t^g _ TT p 7T 
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2 ■ 
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but I is not 
Jo 



2' 



5=f e-'-^+^^+«^''rf2 = :^e--^, but (""isnor 
J-oo vV .'o 

if Pg > i?^ but f °° f " is not i . 

JOJO 4: 

A = e zrfz = T- e Q , but I is no< -^ . 

J -00 0* Jo 2 
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/•oo /-oo y 

but I is woi T . 
Jo Jo 4 



2{PQ-R^f' 



Note. In the above integrals P and Q do not, as often, represent 
constants whose sum is unity. R is not a correlation coefficient, although 
it is closely connected therewith. 



APPENDIX III 

NOTATION 

A simple, consistent, and easily remembered notation is an essential 
in any mathematical treatment. The chief difficulty in the way of 
obtaining consistency in a work like the present is the desire naturally 
felt to keep any piece of research in the notation used by its originator. 
But a great efEort has been made to attain a notation which could be 
used throughout the book without too seriously altering the appearance 
of any well-known formula. Most of the symbols in the following 
Table have been used by several important writers with the meaning 
here indicated. They are either always used with this meaning: or if 
there be any exception it is hoped that the context will make clear what 
is meant. 

The greatest difficxilty has been felt over the choice of a symbol 
for a stimulus value. The commonest symbols hitherto used have been 
R and S for stimulus and sensation intensity respectively, and r has 
been used instead of jR by at least one very important writer. But 
there is still greater unanimity of usage in employing r and R for correla- 
tion coefficients. In adopting small s for stimulus, we realise that there 
is some danger of confusion with great S used in the past, by writers 
on Weber's Law, for sensation intensity. But some of the same writers 
also use S for a standard stimulus (as we do) : and after the first chapter 
we have no necessity to speak of sensation-intensities at all, it is frankly 
physical stimuli which are measured. In the first chapter itself ambiguity 
or misunderstanding is avoided by printing the words at full length. 

It is hoped that this list of symbols may assist in obtaining unanimity 
and consistency in the notation of the subject and serve as a convenient 
reference. 
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Symbols Used 



a a mean, m is also used foi a mean, and any quantity with a bar, as x, is a mean. 

A a true value. 

Oj, Oj, a, readings of which a is the mean. 

b a coefficient of regression. Usually suffixes are added. 

c and C constants. Je is also used in this sense. 

Cj , Cg , and ^ contingency symbols. 

S a deviation: aJso used for a finite increment, d indicating an infinitesimal 

increment. 
e an error. 

h modulus of precision in one notation of the Gaussian or Normal Curve of Error. 

h a constant. Also c and C. See also n. 

I see n. 

m a mean. See also n. 

M Miiller's weights. See to. 

H and V momenta. 
N see n. 

n a number (of experiments, or the like). Where necessary, N, and also k, I, m, 

are used in this sense, m more commonly is used for a mean. 
p a mathematical measure of probability. 

P same as p. 

Q same as p. Usually Q ia 1-P: also used, usually with a suffix, for a quartile. 

q same as p. Usually q ial -p. 

r a correlation coefficient. 

7J a correlation coefficient. 

7) a correlation ratio. 

s a stimulus value. Siaa, standard stimulus, V a variable, s is also on one occasion 

used for a standard deviation. 
<r a standard deviation. Also 2 is used, and on one occasion a. 

S a standard deviation. Would also be used for a summation of any kind other 

than that for which S is used in this book; but no such occasion arises in 

the present work. 
iS a sum over the distribution, e.g. S (e) is the sum of the errors. 8 is also used for 

a standard stimulus, see small s and also the remarks preceding this table. 
t t, with X, y and z, is used as a variable of integration. 

T Thresholds, both absolute and difference. Sometimes with suffixes as T„ and 

Tj for upper and lower difEerence thresholds or limina. 
u u and v "residuals" in observation equations, 

t; see u. 

V variable stimulus, see s. Also coefficient of variation. 

w "weight" of a reading or an equation. Also W, and M, the latter for Miiller's 

weights only. 
W see w. 

Y in connection with goodness of fit. 

X X, y, z and t are used as variables of integration. 

y gee X. 

z see X. 
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engaged in the work of research, dealing with the various subjects 
that come within the field of Psychology. The extent of this field 
may be gauged from the titles of the volmnes given below, some of 
which have already appeared, the others being in preparation. 

For the reasons indicated, the Cambridge Psychological Library 
will appeal not only to Psychologists but also to students of many 
other departments of scientific thought and investigation, such as 
Philosophy, Sociology, Art, Comparative Religion, Anthropology, 
Physiology, Zoology, Medicine, Education and Industrial Efficiency. 

[p. T. o. 



The following volumes have been arranged: 

Psychological Principles. 

By Prof. James Ward, Sc.D., F.B.A. 
Royal 8vo. 2nd edition. 24s net. [Now ready 

An Introduction to the Study of Colour Vision. 

By J. Herbert Parsons, D.Sc. F.R.C.S. 

Royal 8vo. Frontispiece and 75 text-figures. 12s 6d net. [Now ready 

The Group Mind. 

By W. McDouGALL, F.R.S. 
Royal 8vo. 21s net. [Now ready 

Mental Measurement. 

By W. Brown, D.Sc. and G. H. Thomson, D.Sc, Ph.D. 

[Now ready 

Prolegomena to Psychology. 

By Prof. G. Dawes Hicks, M.A.. Ph.D., Litt.D. 

Psychology in Relation to Theory of Knowledge. 

By Prof. G. F. Stout, M.A., LL.D.. F.B.A. 

The Nervous System. 

By Prof. C. S. Sherrington, M.D., F.R.S. 

The Structure of the Nervous System and the Sense Organs. 
By Prof. G. Elliot Smith, M.D., F.R.S. 

The Psychology of Mental Differences. 

By C. Burt, M.A. 

The Psychology of Personality and Suggestion. 
By T. W. Mitchell, M.D. 

The Psychology of Dreams. 

By Prof. T. H. Pear, M.A. 

Comparative Psychology. 

By Prof. Carveth Read, M.A. 

The Process of Conventionalisation. 
By F. C. Bartlett, M.A. 
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